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Editor's Preface 


The revolution in the teaching of mathematics at schools and colleges 
is now well established. Not only have the methods of teaching changed 
considerably, but so have the topics which are taught. This is as it 
should be—the needs of our future mathematicians and scientists are 
constantly changing in the light of the technical developments of the 
twentieth century. It seems likely that the frontiers of science will be 
pushed further and further back in the coming years and the mathe¬ 
matics required must certainly keep abreast of these changes. If these 
developments are to continue rapidly, it is necessary for the students 
to meet and understand new ideas whilst in their last years at school 
and their first at college and university. 

The ‘Contemporary Mathematics’ series of books and booklets has 
been written after three years’ thought and experiment, and has been 
designed for these major purposes: 

1. To produce texts to cover the new requirements of the major 
school mathematics projects (‘A’ and ‘S’ level in Britain). Texts 
will provide the mathematical developments and cover the 
important applications (e.g. Matrices: Pure and Applied). 

2. To provide an introduction to topics which are both contemporary 
and important, showing also how these might best be taught (e.g. 
An Elementary Introduction to Operational Research). 

Many schoolsareexperimentingwithfirstexamination (‘O’ level) work, 
but in senior classes (‘A’ level) the experiments are just developing. 

It would seem sensible to develop and continue a number of the first 
examination (‘O’ level) topics—for example, Boolean Algebra, Matrices 
and Computers all combine in Boolean Matrices (the key to many 
computer circuits). Matrices, Vectors, Numerical Methods, Analogue 
and Digital Computing and Probability and Statistics are part of every 
new course and are to be included. 

It is hoped that these books will help students to appreciate at least 
a few of the current, most pressing applications which must soon be 
part of every mathematician’s repertoire. 

London A. J. S. 

1969 
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Authors' Preface 


This book is intended as an extension and continuation of Matrices 1 
and Matrices 2 (both by G. Matthews) at Sixth Form level. The book 
contains a short revision of ‘O’ level matrix work but is, in fact, com¬ 
pletely self-contained. The first section (Chapters 1-5) is an investigation 
of some aspects of matrix algebra and covers all the known ‘A’ level 
syllabuses. The purpose of this section is to provide the mathematical 
tools which will be of most use later on when dealing with the applica¬ 
tions. The second section (Chapters 6-9) is an attempt to give an insight 
into some of the many and varied applications of matrices. 

It is impossible to include a complete list of applications as this 
would be far too large, and undoubtedly some interesting topics have 
been excluded. However, the selection given has been taken from as 
wide a field as possible and we hope will be of great interest. 
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Matrices and Determinants: 
An Introduction 


U A wmtrix is a rectangular array of m rows and n columns and is 


M 


*11 

*12 

*13 

• * ‘ «Xn 

*21 

*22 

*23 

• * * a 2n 

*31 

*32 

*33 

‘ • «3n 

*ml 

*m2 

*m3 

*mn 


The matrix M above is said to be of order m x n because it has m 
iow» and n columns. Each element a tj or entry of the matrix belongs to 
a given tet,* where i refers to the row and j to the column, e.g. a 2 3 would 
be the dement in row 2, column 3. 

90MB MATRICES 

A m a tri x that is of order n x n is a square matrix, i.e. has the same 
number of rows and columns. 

A matrix that is of order 1 x n is a row vector, e.g. (1 2 3 
A matrix that is of order m x 1 is a column vector . For example, 

1 

2 
3 


m 

* MdeBy the set of real numbers R, but the results are true for the set of complex 
Htnbwi. Later, other sets will be considered, e.g. matrices with Boolean entries. 
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A matrix with all elements zero is called a zero matrix. For example, 

(0 0\ (0 0), /0 0 0\ 

\0 0/’ \0 0 of 


EQUALITY OF MATRICES 

Two matrices are said to be equal if (a) they are of the same order, and 
(b) their corresponding elements are equal. 


ADDITION OF TWO MATRICES 

Two matrices A and B can be added if, and only if, they are of the tune 
order. The sum A + B is obtained by adding corresponding element! 
For example, 

( an «l 2 \ ii b v \ 

Oz\ °22 1 an< i = I ^21 ^22 I 
a 31 a 3 J V>3i b 3 .J 


( Oil + t>U a 12 + *12\ 

a 2 i + b 2 1 a 2 2 + b 2 2 1 
,a 3 i + b 31 a 3S + b 3 J 


MULTIPLICATION OF A MATRIX BY A SCALAR 

If k e R and M is the matrix defined earlier then kM is the matrix 
formed by multiplying every element a w of M by k. For example, 

la b c\ _ 12a 2b 2c\ 

2 \d e f) ~ \2d 2e 2f) 


Exercise 1.1 

1. What is the order of the following matrices? 

(a) ^9 2^ (b) Q (c) (1) 


(d) ip q) 

ig) (4 2 3 1) 
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2. If A, B, C are all matrices of order m X n, and the elements of each 
of them belong to R, show that 

(i) A + B = B + A (the commutative law) 

(ii) (A + B) + C = A + (B + C) (the associative law) 

3. A is an m X n matrix and B is the matrix (—1)^4, find the matrix 
A + B. Use these results to formulate a definition of matrix subtraction. 

4. If A is an m X n matrix and O is the m X n zero matrix, what is 

4 + 0? 

5. Prove that the set of m x n matrices with elements belonging to R 
form a commutative group under addition. 

Multiplication of two matrices 

1.2 If A is a matrix of order m x n, and B a matrix of order p. x q, 
the product AB is only defined if n = p. In this case AB is a matrix of 
order m x q. Matrix multiplication is illustrated by the example.* 



/ (fluba + gl2^2l) 011^12 + #12^22\ 

Z>AB = I Clzibn + #22^21 #31^12 + 22^22 I 

\ #3X^11 + a 32^21 #3X^12 + 032^22/ 

The ringed elements show how one element is formed, for 
(#11 #12)^£ 1:L ^ = (#11^11 + #12^21) 

In this example BA does not exist. 

Exercise 1.2 

1. If A = ( 12 3), 



state which of the following products exist. Work out those which do. 

(a) AB (b) BA (c) AC 

(d) CA (e) BC (f) CB 

* This method of matrix multiplication can be justified by repeated application 
of linear transformations (see p. 91). 
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2. Is matrix multiplication commutative? If not give a counter 
example? 

3. If A is of order m x n and B is of order p X q, what is the con¬ 
dition for both AB and BA to exist? 

4. If A and B are defined as in 1.2 and 

C _ /Cll Cl2 Cl3\ 

\C21 C22 C23/ 

show that (AB)C = A(BC). Can you extend this result to show that all 
matrix multiplication is associative? 

5. By considering three matrices P, Q and R, not all square but of 
suitable orders, investigate whether P(Q + R) = PQ + PR- 

6. Find the conditions that must be imposed on the order of two 
matrices for them to be commutative under multiplication, i.e. 
AB = BA. 

7. Find two matrices A and B such that AB = O but such that neither 
A — O nor B = O. 


The transpose matrix 
1.3 If 



then the transpose is that matrix whose rows are the columns of M. 
It is written as 

(a d g\ 

M'= \b e h 

\c f iJ 

Another example. If 



Exercise 1.3 
1. If 



show that 
2. If 


(AB)' = B'A’ 
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3. A square matrix is said to be stochastic if all the elements are 
non negative and the sum of the elements in each row is 1. Show that 
the product of two 2x2 stochastic matrices is stochastic. 

There is a row vector R = (x y) such that x + y = 1, associated 
with a stochastic matrix 



and satisfying the relation RS = R. Find x and y in terms of a and h. 
Hence find the row vector R associated with the matrix 

(SMP Additional ) 



DETERMINANTS 


1.4 The determinant\A\ = 


a b 
c d 


of the 2 x 2 matrix A = 



is an alternative notation for the expression ad — be. 


a 

d 

g 


b 

e 

h 


c 


f 


, the determinant of a 3 X 3 matrix. 


is defined as 

d e 
g h 

1.5 Note that in this expansion by the elements of the first row, the 
2x2 determinants are obtained by ‘covering up’ the row and column 
in which each first row element appears, i.e. 


e f 
h i 


-b 


d f 
g 1 


+ c 


<1 e f 

-b 

d < 

'- & 

' / 

+ C 

-a — b —£ 
d e f 

jr h i 


8 > 

i i 


8 h J 


In the same way, the determinant 

abed 
e f g h 
i j k l 
m n o p 
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is defined as 


/ 

s 

h 

e 

g 

h 

e 

f 

h 

e 

f 

g 

i 

k 

l 

i 

k 

l 

+ c i 

j 

l ■ 

- d i 

j 

k 

n 

0 

P 

m 

0 

P 

m 

n 

P 

m 

n 

0 


Determinants of higher order square matrices can be defined in a 
similar manner. 
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By considering trapezia such as PACQ, or otherwise, show that the 
area of the triangle ABC with vertices at A(xx , yi), B(x 2 , 72 ), and 
C(x 3 , yz) is given by 

xi yi 1 
i x 2 yz 1 
xz yz 1 

(ft) Find an expression for the area of the triangle as given in the 
figure below 



(c) Show that 

x y 1 
a b 1=0 
c d 1 

is the equation of a straight line passing through two distinct points 
(a, b ) and (c, d.) 

3. If M is the matrix 

b c 

* f 

h i 

show that \M\ = |M'|. 

4. If M is the matrix defined in question 3, and M\ is the matrix 
obtained by interchanging any two rows of M, show that 

\M\ = - \Mi\ 
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5. Use the results of the last two questions to find (without expansion) 
the value of the following determinants, given that 


1 -1 2 
3 4-5 

0 -2 -3 


= -43 


(«) 

0 -2 -3 

(*) 

2 1 -1 


3 4 -5 


-5 3 4 


1 -1 2 


-3 0 -2 


6. (a) If 


[a b 
M —\d e 
\g h 



and Mi is the matrix obtained by multiplying any row of M by 2, 
show that X\M\ = \Mi\. 

(b) Use the result of question 3 to show that if M 2 is found by 
multiplying any column of M by A then A|M| = \M%\. 

(c) If 


1 -1 2 
3 4-5 

0 -2 -3 


= -43 


find, without expansion, the value of the following determinants: 


(i) 


10 

-10 

20 

3 

4 

-5 

0 

-2 

-3 


(iii) 



1 

4 

-2 



(ii) 


6-2 4 

9 4-5 

0 -2 -3 


(d) Show that 


60 

24 

84 


1 

2 

1 

15 

1 

14 

=2x2x3x3x5x7x 

3 

1 

2 

30 

6 

21 


2 

2 

1 


= 2 2 X 3 8 X 5 X 7 
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7, (a) Show that 


a b c 


ci Xd b 4" ^ c + Xf 


d e f 

= 

d e f 

where A e R 

g h i 


g h i 



(b) Use the results of question 4 to show that a constant multiple 
of any one row can be added to any other row without altering the 
value of the determinant. Use the result of question 3 to show that this 
is true also for the columns. 


(c) Evaluate 


0) 

51 

61 

71 

(ii) 

16 

19 

23 


5 

6 

7 


15 

18 

22 


1 

1 

1 


13 

17 

20 


Some properties of a 3x3 determinant 

1.6 The following properties were illustrated in the previous exercise. 

1. The value of the determinant is unaltered by interchanging its rows 
and columns. 

2. Interchanging any two adjacent rows (or columns) changes the 
sign of the determinant. 

3. If all the elements of any row (or column) are multiplied by a 
constant A then the value of the determinant is multiplied by A. 

4. If to the elements of any one row (or column) a multiple of any 
other row (or column) is added, the value of the determinant remains 
unaltered. 


The minors of a 3x3 determinant 

1.7 Consider the determinant 

a b c 
A = d e f 
g h i 

Covering up the row and column in which the element a appears, 

f\ 


the determinant 


is obtained. 


This determinant is known as the minor of a. Now 


A = a 


e f 

h i 


-b 


d f 
g * 


+ c 


d e 
g h 


= a x (the minor of a) — b X (the minor of b) + 

c x (the minor of c) 
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1.8 


To expand by another row (or column), consider for example 


A = a(ei -fh)- b(di -fg) + c(dh - eg) 

= aei — afh — bdi + bfg + cdh — ceg 
= —bdi + cdh + aei — ceg — afh + bfg 
= —d(bi — ch) + e(ai — eg) — f(ah — bg) 




a 

g 



b 

h 


— —d x (minor of d) + e x (minor of e) — 

f X (minor of f) 


This is the expansion of the determinant by the second row. 

To expand by any other row or column is now easy, once the rule 
for the signs has been obtained. By regrouping the terms as before it is 
easily seen that the rule for signs is 

+ - + 

- + - 
+ - + 

For example, the expansion of A by the 3rd column is now 


c 


d 

g 


e 

h 



b 

h 



b 

e 


The cofactors of a 3x3 determinant 

1.9 If A is given as above, 

The cofactor of element a is A = + 
and 

The cofactor of element b is B = — 


/ 

i 

f\ 


= + (the minor of a) 
= — (the minor of b) 


The cofactors of the other elements can be found similarly using the 
rule of signs above. 

The expansion of a determinant can now be obtained for any row or 
column in terms of its cofactors, e.g. 

A = # A + b3 + cC = Z>B + eE + AH 


Factorization of determinants 

1.10 Symmetrical determinants can easily be factorized by the use of 
the Remainder Theorem (if two rows or two columns of a determinant 
are equal then the value of the determinant is zero) and an appeal to 
symmetry. 
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Example 

Factorise 

x y z 

X 2 y2 z 2 

yz zxxy 

(i) If x = y the first two columns are identical. Hence x — y is a 
factor. By symmetry y — z and z — x are also factors. 

(ii) The leading diagonal is x 

J 2 

xy 

which would give on multiplication x 2 y s , i.e. something of 
order 5. 

Now we already have factors of order 3 {(x — y)(y — z)(z — x)} 
and we require a symmetrical factor of order 2. There are only 
two possible—either x 2 + y 2 + z 2 or xy + yz + zx (or a linear 
combination of these). By comparing the possibilities we see that 
the factors must be 

K(x — y)(y — z)(z — x)(xy + yz + zx) where K is a constant 
and K is seen to be one. 

It is not always possible to consider only the leading diagonal as on 
occasion this term may cancel. 

The simple symmetrical functions are for example 

x + y + z; x 2 + y 2 + z 2 ; xy + yz + zx. 

Exercise 1.5 
1. Evaluate 

(i) 1 15 —97 (ii) 27 85 94 

0 84 79 1 -1 0 

0 2 1 4 2 100 

(iii) 25 30 35 (iv) 8 16 24 

15 35 40 24 15 23 

10 25 45 40 1 1 

( V ) _3 _4 2 (vi) 781 397 542 

1 0 -2 115 201 136 

-5 -6 7 551 -5 270 





12 


Matrices: Pure and Applied 


2. (a) If 


a b 



c 


f 



and A, B,. . . denote the co-factors of a, b ,. . . 
Evaluate 


(i) aA + eE + il. 

(ii) aB + dE + gH. 


(b) Find the cofactors of (i) p (ii) q in each of the following 


/ 

m 

n 


a 

0 


p 

<1 

r 

9 

b 

p 

1 

s 

t 

u 


c 

0 

1 


? P 
a b 


3. Find the integral root of 


2-2 2 3 

1 6-2 7 

2 7 11-2 


= 0 


and approximate to the other two roots. 

4. Give reasons from the list of properties of a determinant on 
page 2.4 that enable the following steps to be completed 

Reason 


0 1 JC 


0 

1 

X 

X X 1 

= (x - 1) 

X 

X 

1 

X 2 — X X 2 — 1 x 2 — 1 


X 

x+1 

x+1 



0 

1 

X 


0 

1 

X 

(* — 1) 

X 

X 

1 

= (•*-!) 

X 

0 

1 


X 

x+1 

x+1 


X 

1 

x+1 


0 

1 

X 


0 

1 

X 

X 

0 

1 

= x(x — 1) 

1 

0 

1 

X 

1 

x + 1 


1 

0 

1 


Hence evaluate the determinant. 
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5. Without expanding the determinant in full, show that 


P q r 

q + r r + p p + q 

1 1 1 


= 0 


6. (a) Show that 


a b c 
Ode 
0 0 / 


= adf 


(b) Hence evaluate without expanding 

(0 


(iii) 


1 0 0 

(ii) 

3 0 0 


2 4 0 


0 4 0 


3 5 6 


0 0 5 


0 0a 

(iv) 

0 0 V2 

0 d b 


0 V3 0 

fee 


a/5 0 0 


7. If A 


-cs- 


evaluate the n x n determinant 


\A\ 0 0 

0 \A\ 0 

0 0 \A\ 


0 0 


0 

0 

0 


• Ml 


8. If 


Evaluate 


/ 1 

2 

4 \ 

- 1 

-2 

3 

\ 7 

0 

i/ 


\A\,\5A\-,5\A\-\\A'\-,\A*A'\ 


9. Factorize 


(«) 

i p p* 

(b) 

1 p p* 


1 q q ‘ ! 


1 q q 3 


1 r r 2 


1 r r 3 
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10. Show that 

x y 1 

y 1 X = (x + j + 1 )(xy — x 2 — y 2 + x + y — 1) 
1 x y 

11. (i) Prove that 

a b c 1 a 2 cP 

d 2 ft 2 c 2 = 1 ft 2 6 3 

be ca ab 1 c 2 c 3 

(ii) Factorize 

x 2 y 2 z 2 

(y + z) 2 (z + x) 2 (x + y) 2 

yz zx xy 

12. Evaluate the determinant 



2 1+7 2 
1-7 “I 4 “7 
2 4+y 3 

where y 2 = —1. Explain how, without evaluation, it could have been 
concluded that the value of the determinant was real. (MEI Pure I) 

13. If (o is a complex cube root of unity, prove that 1 + co + co 2 = 0. 
Find in its simplest form, the value of the determinant, 

1 1 1 

1 ft> co 2 

1 ca 2 ft) 3 (MEI Specimen) 

14. Evaluate the determinant 

8 9 10 

A = 4 5 6 

91 92 93 

Hence or otherwise find the area of the triangle ABC where A is 
(91, 95), B is (92, 97), and C is (93, 99). (MEI Specimen) 

15. (i) Prove that 


1 

2x 

x+y 

27 

1 

x + 7 

2z 

2z 

X + z 

y + z 


2 

x + z 
2J 

y + z 

1 


'O - z)(z - X)(x - J>)* [ 2 
.0 + z)(z + x)(x + y)_ 





Matrices and Determinants 


15 


(ii) Prove that the nth-order determinant 

x a a a . . 

ax 0 0 . . 

a 0 x 0 . . 

a 0 0 x . . 


is equal to x n — («— 1 )a 2 x n ~ 2 . (O & C) 

16. (i) Solve the following equation for x given that a 2 1. 

jc 3 a 2 + 1 1 

<P x 2 + 1 1 = 0 
1 x 2 + a 2 1 

(ii) In a determinant of order n 9 every element in the principal 
diagonal is equal to x, and every other element is equal to y. Show that 
the determinant has (x — y) n ~ x as a factor, and find an expression for 
the determinant. (O & C) 

17. (i) Express the determinant 

abed 
d a b c 
c d a b 
b c d a 

as a product* of two (real) linear factors and one (real) quadratic factor, 
(ii) Express the determinant 

x 2 + 2yz z 2 + 2xy y 2 + 2zx 
y 2 + 2zx x 2 + 2yz z 2 + 2xy 
z 2 + 2 xy y 2 + 2 zx x 2 + 2 yz 

as a product of two determinants whose elements are linear in x, y 9 z. 
Hence or otherwise express the determinant as the product of linear 
and quadratic factors. (O & C) 

18. Factorize the determinants 


a 2 (a + l) 2 (a + 2) 2 

(ii) 

x a b c 

* 2 (b + l) 2 (b + 2) 2 


a x c b 

c 2 (c + l) 2 (c + 2) 2 


b c x a 
c b a x 


( O&C) 


* The product of two determinants is found in the same way as for matrices. 
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The inverse matrix 
1.11 The n x n matrix 


1 1 

0 

0 . 

. . <h 


1 

0 . 

. . o\ 

0 

0 

1 . 

. . 0 \ 


6 

0 . 

1 ! i / 


is known as a unit matrix. Simple examples of unit matrices are 



It is easily verified that if A is also n x n 

IA = AI = A, also AI = A if A is m X n; IA = A if A is n X p 


The inverse of a square matrix A is that matrix if it exists, such 
that AA- 1 = A- 1 A = I. 


The adjoint matrix 
1.12 Consider 



and the matrix N, whose elements are the cofactors of the elements of 


\M\ 


[A B 
jV= ID E 
\G H 


Now the product 


la b c\ I A D G\ 
MN' = I d e /}( B E H 
\g h i )\C F I / 


( aA "h hB + cC 
dA + eB + fC 
?A + hB + iC 


flD + bE + cF aG + Z?H + cl' 
dD + eE + /F dG + eH + /I 
g'D + hE + /F + AH + ilj 


/A 0 0\ 

(0 A 0 

\0 0 A J 











Matrices and Determinants 


17 


by the properties of cofactors* where 

n o o\ 

A = \M\ = A 10 1 0 

\0 0 1 


Hence 
and since 


M x 


(H- 


= A/ 
I 


MM- 1 = I 
M~ 1 = ^N' 


N f is known as the adjoint matrix of M and is written ‘adj MS 

Inverses exist only for square matrices since determinants are defined 
only for square matrices. Other ideas of inverses will be found in other 
systems. 

Inverse of a 3x3 matrix by the adjoint method 

1.13 Use is made of the fact that M -1 = adj Af. The method is 

illustrated by a worked example. 

Worked Example 

To find the inverse of the matrix 

a 2 4\ 


-1\ 



The matrix of the cofactors of M is N 

/ —16 6 

n 

Transpose N to find adj M 


Multiply M by adj M 

n 2 4\ 
M.aijM= 0 11 6 
\l 3 2/ 

* See p. 10 and Exercise 1.5 Question 2. 


8 

-2 

- 1 

I 


8 

-6 

l ) 



/ 

-16 

8 

8 \ 


= 

6 

-2 

-6 


\ 

- 1 

-1 

1 / 


-16 

8 

8 \ 

/-8 

0 

6 

-2 

-6 

= 0 

-8 

-1 

-1 

1 / 

\ 0 

0 
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A = \M\ = —8 


M-i = ^ adj M = - i 


As a check 




-16 8 

6 -2 

-1 -1 


MM- 1 = 


/I 2 4W 2 — 1 -1\ 

0 1 6-| i | 

\1 3 2/\ i i -if 

/2 — f + | — 1 + i + J 
— I + I i + I 
\2-t + i -1 + l + i 




Singular matrices 

1.14 A square matrix that has no inverse is said to be singular ; 
otherwise it is said to be non-singular. 

Since adj M it follows that a singular matrix has A = 0. 

Matrices other than square matrices have no inverses since their 
determinants are not defined. 


Exercise 1.6 

1. Use the non-singular matrices 



to demonstrate that (AB)- 1 = B^A- 1 . 
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2. Use the adjoint method to find the inverse (if it exists) of each of 
the following matrices: 


ia) 


e;> 


(c) / 9 15 18\ 
(12 24 21 
\21 42 39/ 




3. Given a matrix A, the unit matrix I and B = I — A, it may be 
assumed under certain conditions that 

A-i = (7 - B)- 1 = I + B + 5 2 + . . . 

This identity leads to an iterative process for calculating the inverse 
of a matrix A. By means of a flow chart show how the partial sums 

C n = I + j^B* 

1 


might be calculated in as few stages as possible. If 



evaluate Ci and C 2 , and also the difference matrix AC 2 — I. 

(MEI Specimen) 


4. The non-singular matrix B has the property BB f = B'B, where 
B ' is the transpose of B. Prove that B'B' 1 = B^B'. Prove also that if 
C = B~ X B\ then CC is the identity matrix. Find B\ BB\ B~ x and C 
when 



{MEI Pure I) 




2 

Elementary Matrices: 
Partitioned Matrices 


2.1 To find the inverse of matrices of order 4x4 and higher, and 
especially when a computer is used, one of the simplest and quickest 
methods is with elementary matrices. Merely as a demonstration, the 
method is investigated for 2 x 2 matrices, but in this case it is long 
and clumsy. 


Exercise 2.1 
1. Let 



where k e C, the set of complex numbers. 

Evaluate the products 

(a) EiA, ( b ) E 2 A, ( c ) E 3 A, (d) E 4 A, (e) E&A 
What happens if the order of multiplication is reversed? 

2. If 

Mi 9- -(“S £) - M n) 


20 
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use the results of question 1 to find (a) LA, ( b ) MB, (c) NC in the cases: 

®L = M=W-(J J) 

?)- M~1 ?)• M* ?) 

"MiJM-U- Mi 4) 

(iv)i "(o !)• M “(o 1). N = (o l) 

<v>mmmm mm) 


The 2x2 elementary matrices 

2.2 The six matrices, 

/ 


-S3 


together with Ei, E%, Ez, E 4 , E 5> defined as in question 1 above, are 
called the 2 x 2 elementary matrices. 


If A is the matrix 


:)* 


then 


(i) 1A = A. 

(ii) EiA interchanges the rows of A. 

(iii) EzA multiplies the top row of A by the number k. 

(iv) £ 3 ^ multiplies the bottom row of A by the number k. 

(v) E 4 A adds k times the bottom row of A onto the top. 

(vi) EzA adds k times the top row of A onto the bottom. 

Similar results are true for the columns of A if the order of multiplica¬ 
tion is reversed, but throughout this chapter only row operations will 
be considered. 


To f I nd the i nverse of a 2 x 2 matrix usi ng elementary matrices 

2.3 By multiplying in front by a chain of elementary matrices, the 
given matrix is reduced to the unit matrix. The method is illustrated 
by a worked example. 
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Worked Example 
Find the inverse of 


U 3 

C,3-(U) 

)/ 

UD 

e 3 -e:) 

U 3 

C 3-C 3 

if 

U 3 

G 3-G 3 


This operation was to produce a 
zero in the bottom left-hand 
element. 

This operation was to produce a 
zero in the top right-hand element. 

This operation was to produce a 1 
in the top left-hand element. 

This reduces the original matrix to 
the unit matrix. 


Note in full what has been performed: 



But matrix multiplication is associative. Hence the brackets may be 
relocated. 


[C 3(!!)« DU 3K .5 - C 3 

But the inverse A' 1 of a matrix A is that matrix such that A~ X A = I. 
Hence if 


—ffi x x DU 
-(-ID 

(-1 -X « 3 



As a check: 
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Exercise 2.2 

Form a chain of elementary matrices to derive the inverse of the follow¬ 
ing matrices: 


e t) 

2 . /-I —3\ 

l-l -2/ 

3. /12 13\ 

4. la b\ 

[ b 

lio 11/ 

5. (i) Prove that 



(.; :r:!)(! -j)(! ?)(: 


Deduce that the rows of a 2 x 2 matrix can be interchanged by opera¬ 
tions which add multiples of one row of the matrix to the other, together 
with operations changing the sign of a row. 

Find a 2 x 2 matrix X such that 

(ii) Do 2 X 2 matrices. A, B, with integer entries, exist such that 

(a) AB = 0,BA 07 

(b) AB = BA = 0, A J=0,B^0, A # B7 

(c) AB = BA # O, A, B # I, A # B7 

In each case, if your answer is ‘yes’, justify it by giving examples of 
suitable A, B. (MET Pure II) 


Better method of finding the inverse of 2x2 matrix 

2.4 The preceding method for finding the inverse of a 2 x 2 matrix 
can be improved upon. In the worked example it was noted that 

id x x du m 

was the inverse of 



[C X X DU 



Hence 
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is also the inverse. But since multiplication is associative, relocating 
the brackets, the inverse is 



Compare this with the multiplication that was applied to the original 
matrix to reduce it to the unit matrix. 


(i °M °M 1)((-3 X .1} 


This means that the inverse of a matrix can be obtained by finding 
the elementary row operations that reduce it to the unit matrix; and 
simultaneously performing these operations on the unit matrix. There 
is no need to find the elementary matrices used; but merely to perform 
the row operations that pre-multiplication by them would achieve. 


2.5 For example, to find the inverse of 



Proceeding with the same row operations as before 

Unit matrix ^ ^ Original matrix Q ^ 

take 3 times the top row from the bottom, R% — 3Ri 



This operation is simultaneously performed on the unit matrix 


U J) 

/ ¥ -f) 

1 

1 

1 

1 

2 0> 

Now take f x Row 2 from Row 1 

(-3 lj 

i l 

.0 4) 

1 (Ri - Pa) 

/ ¥ -h 
1-3 lj 

i i i 

(l O'' 

4J 

| Take J of Row 1 (Pi) 

/¥ -11 
l-i i) 

i i i 

^1 0> 
10 1) 

| Take £ of Row 2 (P 2 ) 
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Now 




has been reduced to 


63 


and simultaneously, using the same row operations. 


n 


becomes 


Hence, using the idea in 2.4, 


/ ¥ -f\ . 
1-1 i) 1 


is the inverse 


a i) 

- G 


Exercise 2.3 


Use the method of row operations above to find the inverse of the 
following matrices: 



2 G 3 

; c -i) 

6 (r given pi ^0 


The 3x3 elementary matrices 

2.6 The ideas of 2 x 2 elementary matrices carry over for 3 X 3 
matrices. 


Exercise 2.4 


1. If 




where Jc e C, the set of complex numbers and E n is given as below, find 
E n A in each case if 


(«) 

(0 

1 

°1 

l CO 

(0 

0 

1 ) 

l (c) 1 


0 

°\ 

Ei = | 

1 

0 

0 

E* = 

0 

1 

0 

£3 = 

0 

0 

1 


[o' 

0 

1 } 

f 1 

b 

0 

0 ) 

■ \ 

10 

1 

0 / 

(d) , 

(k 

0 

0 \ 

l (e) i 

p 

0 

0 \ 

! CO 

/l 

0 

°\ 

Ei=\ 

0 

1 

0 

£5 = 

0 

I 

k 

0 

= 

0 

1 

0 


[0 

0 

V 


[0 

0 

l) 

r 1 

io 

0 

k) 
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(8) 1 


0 

k\ 

1 (h) 1 

n 

0 


1 ® 1 

1 

k 

°\ 

Ei = 

0 

1 

0 

E s = 

0 

1 

k 

£9 = 

° 

1 

0 

1 

lo 

0 

l) 

f ' 

lo 

0 

1 ) 

f ' 

lo 

0 

1 / 

(7) J 

/I 

0 


, (*) 

n 

0 

°) 

i (0 l 

f 1 

0 

°\ 

Eio = j 

0 

1 

o 1 

En - 

k 

1 

0 

Eli = 

° 

1 

0 


\0 

k 

1) 

f 1 

\o 

0 

1) 

f ' 

[k 

0 

1 / 


2. Use the results of question 1 to compute the following products. If 


A = 



find EA when 


(0 , 

(0 1 0^ 

(ii) / 

E= | 

1 0 0 
lo 0 ij 

-( 

(iii) , 

n 0 i) 

(iv) / 

E = 

0 1 0 
lo 0 1 J 

-( 

(v) , 

fl 0 0) 

1 (Yi) / 

E = 

0 i 0 

lo 0 1 ) 

1 - 



2.7 The thirteen matrices. 


/ = 



together with Ei, E %,. . £ 12 , defined as in question 1 above, are 

called the 3 x 3 elementary matrices. If 


then 



(0 IA = A. 

(ii) E X A, E 2 A, E 3 A each interchange two rows of A. 

(iii) E±A, E B A, E e A each multiply one row of A by a scalar k. 

(iv) E 7 A, E e A,. . ., E 12 A each add k times one row of A to another 
row. 

Similar results are shown to be true for the columns of A by reversing 
the order of multiplication. Here only row operations are considered. 
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Echelon form 

2.8 A matrix which has only zeros below its leading diagonal is said 
to be in echelon form . 

An example of a 3 x 3 matrix which has this form is 



In this case the leading diagonal is 

a 

d 

f 

2.9 A 3 x 3 matrix can be reduced to echelon form by pre-multi- 
plication by a chain of elementary matrices. The method is shown by 
the following example. 


Worked Example 

Using a chain of elementary matrices reduce 






/ 3 0 

3 \ 






( 4 2 

6 ) 






\12 18 

6/ 


to echelon form. 









/ 3 

0 

3 \ 





( 4 

2 

6 





\12 

18 

6 / 





/ 



/ i 

0 

°W3 

0 

3 \ / 3 

0 

3\ This subtracts 4 

0 

1 

° 4 

2 

6 = ( 4 

2 

61 times the top row 

\-4 

0 

l/\l2 

18 

6 / \0 

18 

—6/ from the bottom 







Rz - 4J?i 

/ 1 

0 

°\/ 3 

0 

3 \ / 3 

0 

3 \ 


1 


2 

6 =° 

2 


\ o 

0 

1 /V 0 

18 

-6/ \ 0 

18 

-6/ 








/ 1 

0 

0\/ 3 

0 

3 \ / 3 

0 

3\ 

° 

1 

0 0 

2 

2 =° 

2 

s' 

1 

* 

”c7 

\ 0 

-9 

1 /\ 0 

18 

-6/ \ 0 

0 

-24/ 










/ 3 

0 

3\ 





0 

2 

21 is now in echelon form. 



\0 

0 

-24 } 
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To find the inverse of a 3 x 3 matrix using elementary matrices 

2.10 The improved method for 2 X 2 matrices is used again. By 
multiplying a non-singular 3x3 matrix by a chain of elementary 
matrices the matrix can be reduced to the unit matrix. 

If M is a 3 X 3 matrix and 

En ... JS&jEfflM = I 

where E n >. . ., E a are 3 x 3 elementary matrices then 
(En. . . EbEa) = M-1* 

This means that 

(E n . . . EaE b )I = M- 1 

Hence if M is reduced to the unit matrix using a chain of elementary 
row operations, then the same operations applied to the unit matrix, 
simultaneously reduce it to 


Worked Example 
To find the inverse of 


/ 3 0 

4 2 
\12 18 


First it is reduced to echelon form 


Unit 

Matrix 


/ 1 

0 

°\ 

I 0 

1 

0 

\ 0 

0 

1 / 


Original 

Matrix 


/ 3 0 

4 2 
\12 18 



0 

1 

0 



0 

1 

0 

0 

1 

-9 



0 

2 

18 




Elementary 
Row Operation 


R 3 - 4Ri 


Rz — fill 


R 3 — 9R 2 


* The product En . . ■ EbE a is unambiguous since matrix multiplication is associative. 
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The original matrix is now in echelon form 


(:i 

(: 

(: 


0 
1 

£ I 

2 -f 
■i 1 
-i I 

2 -I 
I 1 
i I 



— 2XR3 


Ri - 3Rs 


Rz — 2R 3 


There are now zeros everywhere except down the leading diagonal 

iRi 


(: 




3 

8 


3 



4*2 


The original matrix has been reduced to the unit matrix by a series of 
row operations, and simultaneously the unit matrix has been trans¬ 
formed into the inverse. 

Notice that it is unnecessary to actually find the elementary matrices 
used. 

As a check 


/ 1 

-s 


/3 

0 

3 \ 

-i 


* 

( 4 

2 

6 

\-i 

I 

-*/ 

\12 

18 

6/ 


-( 


2 — 1 + 1 
-1 + i + i 
-1 + f-i 


-I + i 2 - S + i' 

i + i -1 + 
f-l -1 + 


+ i \ 
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Exercise 2.5 


Use elementary matrices to find the inverse of each of the following 
matrices. 



The case of a singular matrix 

2.11 Elementary row operations are used to try to find the inverse of 


Take Rs — f^i 


i?2 — 


Rz — 
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Now the matrix is in echelon form but there is a zero in the leading 
diagonal. It is now impossible to use elementary row operations to 
reduce the matrix to the form 



where none of a, b, c are zero. Hence it is impossible to reduce the matrix 
to the form 



It follows that the matrix 



is singular, and this can easily be verified by considering the determinant. 
Exercise 2.6 

Try using elementary row operations to reduce each of the following 
singular matrices to the unit matrix. 



11 4 3\ 2 

(3 5 9\ 

1 . 1 

1 1 1 

2 4 8 


\5 7 6/ ' 

U 3 7/ 


(5 10 9\ 4. 

fa b c\ 

3 1 

8 9 131 

0 0 d 


\5 5 8/ ' 

i,0 0 ej 

PARTITIONED MATRICES 



2.12 Any matrix may be partitioned by a series of horizontal or 
vertical straight lines into a set of submatrices. For example, 

1 2 3 

4 6 6 

7 8 9 

10 11 12 
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may be partitioned as 



C = (10 11) D = (12) 


Exercise 2.7 
Show that 


1 2 3 


7 8 9, 


4 5 6 + 10 7 4 


11 8 5' 


9 6 3, 


)-( 


A + E 
C + G 


5 + n 

D + Hj 


where (i) 






<N 

II 

)• -0 

1. 

C = (7 

8), 

D = ( 9) 

MS 

OO 

II 

1. 

G = (9 

6), 

H = (3) 

and (ii) 



/4) 

l 

„ /5 6\ 

A = (1), 

B = (2 

3), 

\7J 

1* 

9j 

E- (11), 

F= (8 

5), 

-Ci 

:)• 

-n 


Block addition 

2.13 The above example shows that the block addition of partitioned 
matrices is possible. It is, however, only possible if the original matrices 
are of the same order, and are partitioned in the same way. 
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Consider what happens if an attempt is made to find 

lA + E B +F\ 

V C+G D + HJ 

a= (^ 5 )’ *-Q’ c=(? 8)> d=(9) 

E=( 11), F={ 8 5), G=( 1 °), H=i^ 


if 


and 


Exercise 2.8 
1. Show that 


where 


2. Show that 


where 


and 


l a 

b 

r 

e 

\g 

h 

b 


e 

fh 

b 

c \ 

e 

/ 

h 

il 

1 , 

B = 


1-(S£> 


A = (“c d)’ B= (j)’ C={g hX jD = (0 


E = 


P 

ir 


F — (3) 


3. If X is partitioned as 



i 

A \ 

1 B \ 

m $ 

\ 

C 

1 D ) 


<— 

- > 

< -> 



ni 

«2 
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and Y is partitioned as 


Pl±_ 

JL 

E 

F 


pt $ 

\ 

G 

H 

) 

v 


"4r 


?2 



where mt, Ji{, pu qi are the integral numbers of rows or columns in each 
partitioned matrix, find the condition that each of the product matrices 
exist in the block product 

(AE + BG AF+BH\ 

\CE+ DG CF+ DHj 


What is the condition that XY exists? Is this condition satisfied by the 
conditions above? 

4. The Pauli spin matrices (associated with Elementary Particle 
Physics) are 



Prove that: 


(i) A* = B* = C 2 = /. 

(ii) BC = -CB = jA 
CA = -AC = jB 
AB = -BA = jC. 

The Eddington-Dirac matrices are 



c== (o Where; 2 = -1. 



Show that for all values of p 



and that for all distinct values of p and q 
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Block multiplication 

2.14 The preceding exercise leads to the conclusion that two par¬ 
titioned matrices can be block multiplied in partitioned form provided 
all of the products of the submatrices exist. 


Matrix inversion using partitioned matrices 

2.15 The matrix whose inverse is to be found is partitioned so that 
A and D are square, non-singular matrices. 

Now if I r n ) is the partitioned matrix (where A, B, C and D are 
\c IJ] y\ 

matrices) with an inverse I I then 

( A n = ( T °) 

\c d)\z w) \0 l) 

where I is the unit matrix of the required order, which leads to 


AX+ BZ — I 

(i) 

AY+ BW = O 

(ii) 

CX+ DZ=0 

(iii) 

CY+ DW = I 

(iv) 


From equations (i) and (iii) 


AX = / — BZ 

and -D-'CX = Z 

=> AX=I + BD-'CX 
=> (A - BD~iC)X = I 
=> X=(A- BD-'Q-i 

and so on, the values X, Y, Z and W being 

X= (A- BD-'C)- 1 
Y= —A~ l BW 
Z = — D~ X CX 
W = (D — CA-'B)- 1 


Using these equations it is necessary to perform four inversions. Easier 
equations to work with can be found by combining the equations (i) 
and (iv) with the four similar equations found by considering 
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The resulting equations require the inversion of only two matrices. 
These equations are: 

X= (A- BD~ l C)~ 1 
T= -XBD- 1 
Z = -D-'CX 
W = — ZBD-1 

Worked Example 
Find the inverse of 

c; i i) 

This matrix is partitioned as follows (although this is not the only 
possibility) 


1 _1 

-2 1 

( 7 

1 1 

\ -2 

-2 1 

C 

D 

[(-1) - (-2 

■ «(1 ' 

[(-l)-(0 

«u)r 


= [(- 1 ) - (- 2)]" 1 
= ( I )' 1 
= (D 


( 11 ) 


Y= —1(—2 1) 

= - 1(0 1 ) 

= (0 - 1 ) 


—(! DU® 

-« 
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-(! »(! 1 ) 
-(! IK:!) 

-(! 4) 

Hence the inverse of in this case gives 


2.16 This method is clearly applicable for finding the inverses of much 
larger matrices. 

Worked Example 
Find the inverse of 



Partition this matrix as shown. There are four equations to solve as 
before: 
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— G =D(1 -X ?) 

-e -i) 

*~G DCS X -!) 

-G 3 

Hi 9-6 -)r» -9G >) 
-G ’)-(=: 3 
■6 3 

Hence the required inverse is 

1 - 12-1 
2 - 14-1 
2 - 45-2 
2 - 24-1 


Exercise 2.9 

Use partitioned matrices to find the inverse of each of the following 
matrices: 
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7. /-I 2 4 -1\ 8. /I 2 2 2\ 

I -2 2 4 -1 \ / 1 3 2 2\ 

1 -1 1 2 -1 I I 1 2 3 2 I 

\-4 2 5 -2/ \l 2 2 3/ 


9. /I 1 £\ 10. /4 3 3 3\ 

/ 1 -1 I -i\ /3 3 2 2 1 

\1 1 £-il 13232/ 

\l 1 -* -1/ \3 2 2 3/ 



3 

Linear Equations 


The determinant method (Dr Cramer’s rule) 

3.1 One solution of simultaneous linear equations is provided by 
making use of determinants. 

From Exercise 1.5, question 2, it follows that if 

a b c 
A = d e f 
g h i 

then the sum of products of the elements of any row with their corres¬ 
ponding cofactors is the value of the determinant, e.g. 

aA + dD + gG = A 

but the sum of these elements with cofactors from another row is 
zero, e.g. 

aB + dE + gH = 0 

Now consider the simultaneous equations 

r ax + by + cz = p ® 

dx + ey + fz = q (D 

,gx + hy + iz =r (§) 

where p, q, r are not all zero. 

If A, B, C,. . . are the co-factors of a, b, c 9 . . . in A above, then 
(A X ®) + (D x ©) + (G X ®) 

gives 

(aA + dD + gG)x + (bA + eD + hG)y + (cA + /D + /G)z 

= pA + qD + rG 

Now (aA + dD + gG) = A, and the coefficients of y and z are both 
zero. Hence 

x = [jpA + ^ + rG] (provided A ^ 0) 
p h c L 

= q « / = a’ say 

r h i 

A 
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ape 

M 

a b p 

Similarly y = 

d q f 
g r i 


d e q 
g h r 


A A 


Notice that the column I q 1 replaces the column of the coefficients 


in A of the required variable. 

The result is easily modified for two equations in two unknowns. 


Worked Example 
To solve 2x + y = 41 
3x + 5y = 13J 


4 1 
13 5 

-1-U v _ 

2 4 

3 13 

2 1 

“7" 1 ’ y ~ 

2 

1 

3 5 


3 

5 



Exercise 3.1 

Use the determinant method to solve the simultaneous equations: 


1. x + y = 3 


2x + 3y 


:9 


3. 2x + 3y — z = n 
x — 4y + 2z = —6 
5x — 2y + 2z = —8j 


2. x + y + 2z = 8' 

2x — y + 3z = 15 
3x + 2y — z = —3, 
4. jc + y — 3z = —51 
x — 3 y + z = 11 
x — y — z— 3 


Some special cases 

3.2 If the above method is employed to find the solution of 
ax + by + cz =p 
dz + ey+fz = q 
gx + hy+iz = r, 

and it is found that the determinants of the coefficients 


A 


d e f 
g h i 


0 


then a unique solution does not exist. In this case no solution of the 
equations will exist, unless the other determinants L, M, N are also 
zero. If, for example, it is found that L is zero, any value can be assigned 
to x, say a, and the problem remaining is to solve the equation for y and 
z in terms of a. A more detailed analysis will be considered later. 
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Exercise 3.2 

Use the above method to find the solution, if it exists, of the equations: 


1. x+ y + z = r' 
x + 2y + z = s 
6x + 5y + 4z = t, 


(i) when r = 3, s 

(ii) when r — 2, s 


6, t = 8, 
6, t = 8. 


2. 15x + 5y — lOz = 20' 

— 9x —3 y + 6z = —12 
lx + my + nz = 16, 


(i) when l = m — n — 

(ii) when l = 12, m = 4, n = —8. 


MATRIX AND GEOMETRIC METHODS 


Two equations in two unknowns 

3.3 Consider the equations 

{ ax + by = p (not both a and b zero) 
cx + dy = q (not both c and d zero) 

These can be written in matrix form as 



and solved by multiplying both sides of the equation by the inverse of 



where A = ad — be # 0. 



d P ~ bq ^ ag-cp 

ad — be ? ad — be 


3.4 The solution above is correct in the general case, but it does not 
give a solution if the determinant of the coefficients, A = ad — be = 0. 
To see what happens in this case an appeal is made to geometry. 
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In general, the equations 


ax + by = p\ 
cx + dy = q] 

can be represented by two intersecting straight lines in a plane. 



The coordinates of the point of intersection of the two lines give the 
solution of the two algebraic equations 

ax + by—p\ 
cx + dy = qj 

Worked Example 
Solve the equations 

The equations can be put in the form 



Mulitiplying through by the inverse of the matrix of the coefficients 



f2x+ y = 4 
\3x + 5y = 13 
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Two special cases 

3.5 (1) When the two lines are parallel. 



In this case the two lines do not intersect, and so there is no solution 
of the equations 

ax + by = p\ 
cx -J- dy — q) 

The equations are said to be inconsistent. 

Since the lines are parallel the equations must be in the form 

ax + by =p\ 
kax + kby = qj 


In this case it is seen that 


A = 


a b 
ka kb 


= 0 


Worked Example 
Solve the equations 

6x + 3 y = 71 
4x + 2y = lj 


Since 


6 3 
4 2 


= 0 


the equations cannot be solved as they are inconsistent. 

It is seen graphically that the equations represent two parallel lines 
since they both have a gradient of —2. 



Linear Equations 


45 


Hence there is no solution. It should be noted that § times the first 
equation gives 

4x + 2 y = V 

which is inconsistent with the second 

4x + 2y = 1 



4x + 2y = 1 


3.6 (2) When the two lines are coincident. This is a special case of (1). 

When the equations are in the form 

ax + by = p) 
kax + kby = kp) 


Again, A = 0, but in this case any point on the line will satisfy 
both the equations. There are an infinite number of solutions. 
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Worked Example 
Solve the equations 

6x — y = 5) 

— I2x + 2y = — 10 / 

It is seen that 



and so the equations cannot be solved by the matrix method. 

It is seen that if the first equation is multiplied through by —2, 
6x — y = 5 becomes — \2x + 2y = — 10 , the second equation. One 
equation is said to be linearly dependent upon the other. Hence the two 
equations are consistent and represent the same straight line. There are 
then an infinity of solutions. If x is given the arbitrary value a then 
y = 6 a — 5. The solution is the set of points (a, 6 a — 5) for all real 
values of a. 

Exercise 3.3 

1. Find the solution set of the following pairs of simultaneous 
equations: 

(a)x+j = 3J (b) 5x-3y = 10| (c) 21x - 2Ay = 61 

x — y = 1/ 3x — 5y = 15/ 18x — 16y = 9) 

(d) Tlx - 24 y = 9| ( e ) -2x + y = 3| (f) lx + = 111 

18x — 16 y = 6) x — 2y — 3) 6x + 7y = 13) 

2. Is (1, 4) an element of the solution set of the equations 

2 x-y= -2' 

2x+y= 6 ? 

+ y = 5, 

Sketch the graph of each equation. Show how it illustrates the answer. 

3. Find the solution set of the equations 

x - y = r 

2x — y = 3 

Ax —2y = 6 . 

4. Investigate the conditions under which there is a solution set to 
the equation 

x+ y = T 
x— 7 = 0 
ax + by = c. 
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Illustrate your answer graphically in three different cases. 

5. Show that any equation of the form 

Max + by —p) + p(cx + dy — q) = 0 represents a line 
which passes through the point of intersection of the lines 

ax + by =p\ 

cx + dy = q} 9 r 

6 . Compute the product 

l-2~ 2 1 - A AWl 2V 

\ 3 — 2p —2p — 1 2p) 13 41 

Use your solution to solve the equations 

x + 2y= 3) 

3x + 4y = 7 
4x + 6y = 10, 

7. By considering the possible geometrical configurations, or other¬ 
wise, investigate the conditions under which the equations 

ax + by = p y 
cx + dy = q 
ex + fy=r t 

have (a) a single solution, ( b ) an infinite number of solutions. 


Three equations in two unknowns 

3.7 Consider the three equations 

ax + by = p 
cx + dy = q 
ex + fy= r 


Solving the last two by the determinant method 


q d 


c q 

r f 


e r 

c d 

> y = 

c d 

e f 


e f 


provided 


c 

e 
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These values of x and y satisfy the first equation if 


+ b 


c q 
e r 


= P 


=> a 


d q 


c ? 


c d 

r 

— b 

e r 

+p 

e f 


If 


b p 
d q 

f r 


= 0 


= 0 


then the last two equations are inconsistent, unless they are the same 
equation, and so the given three equations have no solution. 

c d 
, =0 
« / 

and the equations are the same, then either of the last two equations 
can be solved with the first one. 


If 


Exercise 3.4 


Use the determinants to find the solution set of the following equations. 


2. 2x + 3y = -1 
3x — 2y = —5 
lx — 2y — 9 t 

4, 


1. fx + y = 5 
2x + 3y = 13 
[lx- y = 11 

3. llx + 4 y = -38 
2x + 4y= -20 
3 x + 6y = — 30j 

5. lx + 3y = 4 1 ! 6. 

lOx — 2y = 6 

— 15x + 3 y = —9j 

7. ax + by = p 
cx + dy = q 

(a + Xc)x + (b + Xd)y = r 


32x — 

24 y = 

16 1 

—8x + 

6y = 

- 4 

3x — 


ij 

2x + 

3j=- 

- 5) 

—3x — 

I y = ■ 

-¥ 

6x + 

18 y = 

3oJ 


where 


a b 
c d 


*0 


in the cases (i) r ^ p + Xq (ii) r = p + kq 
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Three equations in three unknowns 

3.8 Consider the equations 

ax + by + cz — p 
dx + ey + fz = q 
gx + hy + iz = r. 

These can be written in the form 


la b c\ lx\ 

K I W 



Writing this as 
then 


i.e. 


Ax = p 
A~ x Ax = A- 1 }* 
=> /x = A -1 p 
=> x = A-*p 



In general, this will give the unique solution of the three simultaneous 
equations. The difficulties arise if the matrix A is singular. 

To examine the difficulties in this case, i.e. when A is singular 


=* A = 


a b c 
d e f 
g h i 


= 0 


an appeal is again made to geometry. 


The equation ax + by + cz = p 

3.9 Just as the set of ordered pairs (x, y), can be put in one—one 
correspondence with the points of a plane, so the set of ordered triplets 
(x, y, z ) can be put in one—one correspondence with the points of 
three-dimensional space. 

Just as the equation ax + by = p corresponds to a straight line in 
the plane, so the equation 

ax + by + cz = p 

corresponds to a plane in three-dimensional space. 
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The intersection of three planes in space 

3.10 To solve the three algebraic equations 

ax + by + cz = p\ 
dx+ ey + fz = q\ 
gx + hy + iz = r) 

the intersection of the three corresponding planes is considered. 
In general two planes meet in a straight hne 



There are three lines of intersection of the planes taken in pairs. 
These three lines meet in a single point, the point of intersection of the 
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three planes. The coordinates of this point give the solution to the 
original three simultaneous equations. 

The set of coordinates of the point illustrated above is the one given 
by the matrix solution to the three equations. Several special cases can 
occur and these are individually examined. 

3.11 (a) The three planes could intersect like the pages of a book. 



In this case the third plane passed through the line of intersection 
of the other two. The three equations then have the form 

ax + by + cz = p 

dx + cy +fz = q 

(Aa + fid)x + (Ab + fie)y + (Ac + fif)z = (Ap + fiq) 
where A, p are constants. 

Notice that the third equation is a sum of linear multiples of the 
first two. Hence we say that the third equation is linearly dependent 
upon the other two and the equations are consistent. There is an 
infini te set of solutions corresponding to the points of the line of 
intersection. Notice in this case the determinant of the coefficients. 


A = 


a b c 

d e f 

Aa + fid Xb + fie Ac + ftf 


= 0 


Worked Example 


It is noted that 

1 1 1 

2-13 
4 1 5 


x+y+ z= y 
2x — y + 3z = 4 
4x + y + 5z = 10j 


(-5 - 3) - (10 - 12) + (2 + 4) = 0 
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It is also noted that the third equation can be obtained by adding linear 
multiples of the first two. In fact twice the first equation plus the second 
produces the third. This means that the third equation is superfluous— 
it is dependent on the other two. 

To solve the first two equations, let z have the arbitrary value a, then 

( 7 -4a 
x — 3 

2 + a 

y 


Hence the general solution is the infinite set of points 


r- 


— 4a 2 + a \ 

— 3 ’ a ) 


3.12 ( b ) The three planes could intersect as the rectangular sides of 

a triangular prism. 



three parallel lines 


In this case there is no common point of intersection of the three 
planes and hence no solution set. Note that the line of intersection of 
any two planes is parallel to the third plane. 

The three equations in this case will be of the form 

ax + by + cz = p' 
dx + ey + /z = q 
(Xa + fxd)x + (Xb + pd)y + (Xc + pf)z = r 

where r ^ Xp + pq. 
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Hence we say that the equations are linearly independent and since 
there is no solution they are inconsistent. 

If this last constraint is removed, the planes meet in a line as in (a) 
above. The determinant of the coefficients is as in (a). Hence again 

A = 0 


Worked Example 

Find the solution set of the equations 


It is noted that 


x + y + z = 3 
2x — y + 3z = 4| 
4 x + y + 5z = 9) 




It is also noted that twice the first equation plus the second gives 
4x + y + 5z = 10, which is inconsistent with the third equation. 
Hence the solution set is empty. 

3.13 (c) Two of the planes may be parallel 


parallel planes 
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Again there is no common point of intersection of the three plans, 
and hence no solution set. The three equations are of the form 

( ax + by + cz = p 
kax + kby + kcz = q 
gx + hy + iz = r 

where q ^ kp . 

The determinant of the coefficients is 


a 

b 

c \ \ 

a 

b 

c 

ka 

kb 

II 

a 

b 

c = 0 

g 

h 

il | 

g 

h 

i 


The equations are linearly independent and inconsistent. 

Worked Example 

Find the solution set of the equations 

2x ~ 3 y + z — 4' 

4x — 6y + 2z = 5 
x - y - z = 1 , 

The determinant of the coefficients is 

2 -3 1 

A= 4 -6 2=0 

1 -1 -1 

Note that twice the first equation gives 

4x — 6y + 2z = 8 

which is inconsistent with the second equation. Hence the corresponding 
planes are parallel and the solution set is empty. 

3.14 id) If in (c) the case is considered where q = kp the three 
equations become 

ax + by + cz = p 
kax + kby + kcz = kp with k ^ 0 
gx+ hy + iz = r t 

In this case the second equation is identical with the first (one is 
linearly dependent upon the other). The determinant of the coefficients 
is exactly as in (c) and is zero. The two corresponding planes are identical. 
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In general the third plane will intersect these two coincident planes in 
a straight line, and the coordinates of the set of points of this line will 
give the solution set of the three equations. 



Worked Example 

Find the solution set of the equations 

2x — 3y + z = 4] 

4x — 6y + 2z = 8 > 

* - y - z = l > 

The determinant of the coefficients is again zero as in (c) above. The 
second equation is twice the first, and so the corresponding planes 
are identical. 

To solve the first and third equation, let x take the value a. Hence 

—3 y + z = 4 — 2al 
_ y _ x = l — «/ 

3a - 5 
-T-—> 

la. — 9 

2 ~ 4 

The solution is the infinite set 



where a e R. 
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3.15 (e) All three planes can be parallel 



The three equations have the form 
ax + by + cz = p\ 

Xax + Xby + Xcz = q\p ^ Xq ^ pr ; X 9 p ^ 0 
pax + pby + pcz = rj 

Since there is no common point of intersection of the planes, the 
solution set of the three equations is empty. Again it is noted that 

a b c 

A = Xa Xb Xc — 0 
pa pb pc 

=> The equations are linearly independent and inconsistent. 

Worked Example 

Find the solution set of the equations 

x — 2y + 3z = 4 y 
2x — 4y + 6z = 5 

—3x + 6y — 9z = —-1, 

It is noted that 

1-2 3 

A = 2-4 6=0 

-3 6 -9 
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Also twice the first equation gives 

2x — 4y + = 8 

which is inconsistent with the second; minus three times the first gives 
—3x + 6y — 9z = —12 

which is inconsistent with the third equation. The three corresponding 
planes are parallel and the solution set is empty. 

3.16 (/) In ( e ) two of the planes may be identical and the third 

parallel to these. 

parallel planes 



The three equations are as in ( e ) but also q = Xp. There is no common 
point of the three planes and hence the solution set is empty. As before 
A = 0 . 

Worked Example 

Find the solution set of the equations 

2x — y+ z = 3] 

6x — 3y + 3z = 9 J 
4x — 2y + 2z = 5 J 

2 -1 1 

A = 6 -3 3 =0 
4-2 2 

Three times the first equation gives the second, and hence the two 
corresponding planes are identical. Twice the first equation gives 

4x — 2y + 2z = 6 
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which is inconsistent with the third equation. The first two planes are 
identical and the third is parallel to these. The solution set is empty. 

3.17 (g) The equations could be as in (/) with the further condition 

that r — iip. The three planes would then be coincident, and their 
corresponding equations identical. The equations are linearly dependent 
and consistent. 

three coincident planes 



As before A = 0. 

Since the planes are coincident, the coordinates of any point of the 
plane would satisfy the three equations. 


Worked Example 

Find the solution set of the equations 


3x — y + 2z = 4\ 
6x — 2y + 4z = 81 
9x — 3 y + 6z = 12) 


A = 


3-12 
6-2 4 
9-3 6 


= 0 


Note that multiplying the first equation by two produces the second, 
and multiplying the first by three produces the third. The three equations 
are identical, and hence their corresponding planes are coincident. The 
solution set is the infinite set of points lying in the plane. Let 

x = a and y = 
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The solution set is then 

(^.^) 

where xeR, fleR. 

It is noted that all these special cases arise when the determinant of 
the coefficients, A = 0 . 


Exercise 3.5 


Solve the following simultaneous equations 


1. x — 3 y + z = 2] 

x + 2y — z = — 11 
2x+ y + 2z = 6) 

3. 2x — 3y + z = 3' 
x + 5y — z = 3 
lx + 9y — z = 15, 


2 . x + y + z = 61 
2 x=y\ 
x + y = z) 

4. 2x — 3y + z = 31 
x + 5 y — z = 31 
6 x — 9y + 3z = 9) 


5. x + ly + 2z = 4' 
x — 7y + 2z = 2 
lx + z = 6j 

7. 9x - 10 y + z= 11 
3x+ y-z=- 1| 

lx- y + z= 1J 

9. 3x + 2 y — 4z = 71 
2x — 3 y + 5z = 2| 
14x + 31/ - 57z = 46) 


6. lx — y — z = 64' 

2x + 5y- z = 59 
x — 3y — 2z = 115, 

8 . 18x + 3j/+ z= 49' 
x - ly + 15z = -54 
2x — 4y + 13z = -25, 

10 . 18x — ly + 5z = 4 ' 
2x+ ly — 3z= 7 
16x — 84j + 40z = 62, 


11. Multiply together the two matrices 



Hence, or otherwise, solve the simultaneous equations 

3x — y + 5z = 32 
5x — ly + 3z = —8 

x —lly + 7z= 20 (SMP Additional) 
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Exercise 3.6 Miscellaneous 

1. Use any suitable method to solve the equations. 


(a) 10* — 4y + 8z = —10^ 
3* + 2 y + z = 11 

5x — 2y + 4z = 5j 

( c ) x — ly + 9z = 26^j 
9x + 8 y — z = — 11 
lx — 8 j + 9z = 33; 

(e) 15* — 10^ + 20z = 30' 
x+ y + z = 0 
21* - 14y + 28z = 42 

(g) * + 2y — 3z — 4 = 0' 

2x — y + z — 5 = 0 

3* + 2 y + z — 14 = 0 > 


( b ) 15* + 9/ - 3z = 41 

10*+ 6 j — 2z = — 3) 
35* + 21 y — lz = 21J 

(rf) 2* — j + 7z = 4^ 

6* + llj — 41z = —4 
3* + 2j/ — 5z = 2 

(/) 5*+ 14^ -23z= 21 

* + 3 y — 6z = 91 

* + ly — 34 z = I 8 OJ 

(/z) 3* — 3j + 3z 

= 6* + 3j/ + 8z 
= -4 


2. Given the matrix 
A = 

and the matrix 

B = 


/ 1-2 3 

I 0-2 0 

\—0*5 01 

/ 0-2 - 6-2 
10-58 3-02 

\002 -1*62 



Find the matrix AB. Hence, or otherwise, solve 

0-2* - 6-2 y — 6z = 12' 

0-58* + 3*02j + 2*6z = 45 

0-02* — l*62j — 0*6z = 78; (ME1 Specimen) 

3 . If two planes II 1 and II 2 have equations 

ITi = ax + by + cz = p 
n 2 = dx + ey + fz = q 

show that Alii + ^n 2 is a plane passing through the line of intersection 
of IIx and n 2 . 

4. (i) A = Find the most general form of the matrix B such 

that AB = BA and prove that all such matrices B are of the form 
oc/ + ft A where a and p are scalars. 
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(«) 



n 

11 Calculate (i) det A (ii) A~K 
4 -1/ 


Hence, or otherwise, solve the simultaneous equations: 


x+ y + z = 6 '| 
2x — y + z — 3 > 
x + 4y — z = 6) 


5. Find the inverse of 
M 


-n :• :i) 



Hence, or otherwise, solve the equations 


where 


6 . Find the Inverse Matrix of 



and 



(MEI Specimen ) 


(MEI Specimen) 


Hence solve the simultaneous equations, 

2x + 3 y + z = 1\ 
x — y — 2z — 1) 

3x + 2 y + 3z = 4) (MEI Specimen) 

7. Find the general solution of the system of equations 

x + y — az =: b 
3x — 2 y — z = 1 
4x — 3y— z = 2 

in each of the three cases (i) a = 1, b = 9; (ii) a — 2, 6 = —3; 
(iii) a = 2, b = 0. {MEI Specimen) 
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8 . Find the value of a such that the three lines 

2x + 3y + 4 = 0 
x — ay — 7 = 0 
3x+ y — l = 0 

are concurrent. 

9. Find the inverse of the matrix 


a 3 2 \ 


5 2 7 
,1 2 1 , 


(MEI Specimen) 


What is the inverse of the matrix 

/2 10 2 \ 

6 4 4 ? 

\4 14 2/ 

Solve the simultaneous equations 

x + 3y + 2z = 1 

5x + 2y + 7z = — 2 

x + 2y + z = 2 (MEI Specimen) 

10 . If 

12 3 6\ 

M=[6 2 —31, 

\3 -6 2/ 

form the product MM' and show that 

Without multiplying out, state the product M'M, giving reasons for 
your answer. 

Hence, or otherwise, find the solution of the equations 

2x 1 + 3x 2 + 6 x 3 = 1 
6 x x + 2x a — 3x a = 1 

3x x — 6x 2 + 2x a = 2 (MEI Mathematics I) 

11. (a) Evaluate the product 

3 -3 1W1 1 1 

-3 5 -2 1 2 3 

1-2 1/ \l 3 6 



Linear Equations 


63 


Hence or otherwise solve the simultaneous equations 
x+ y + z= 2 
x + 2y + 3z = 6 

x + 3y + 6z = 13 (SMP Additional) 

HOMOGENEOUS EQUATIONS 
Two equations in two unknowns 

3.18 Consider the two equations 

ax + by = 01 
cx + dy = Oj 

If the corresponding lines are considered, they both pass through the 
origin, and hence the equations have the trivial solution 

x = 0 , y = 0 

Other solutions will also exist if the two equations represent the same 
line. In this case the gradients are the same. 

a __ c 
b = d 

=> ad — be = 0 
a b\ 

^ = 0 
c d | 

which is the condition for a special case as in 3 . 6 . 

Three equations in three unknowns 

3.19 Consider the equations 

ax + by + cz = 0 
dx + ey + fz = 0 
gx + hy + iz = 0 

If the corresponding planes are considered, then clearly each plahe 
passes through the origin. 

The three equations therefore always have the trivial solution 
* = 0 , y = 0, z = 0 

Other solutions are only possible if one of three special cases apply, 
i.e. when 

a b c 

A= d e / = 0 as in 3.14 
g h i 



64 


Matrices: Pure and Applied 


The three special cases 

3.20 (a) The three planes may meet as the pages of a book in a line 

through the origin. 



line of intersection 


In this case, as with non-homogeneous equations. 


A = 


a b 
d e 
g h 


c 


f 


= 0 


The coordinates of any point on the line of intersection of the three 
planes will satisfy the three equations. 


3.21 (b) Two planes may coincident and the third meet these in a line 

through the origin. 



In this case again 


A = 0 
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The coordinates of any point on the line of intersection of the three 
planes will satisfy the three equations. 

3.22 (c) The three planes may all be coincident, i.e. the three equa¬ 

tions represent a single plane through the origin. Once again A = 0. 
The coordinates of any point in this plane will satisfy the three equations. 



3.23 In general , the equations 

ax + by + cz = 0 
dx + ey + fz = 0 
gx + hy + iz = 0 

represent three planes through the origin. They always have the trivial 
solution x = 0 , y = 0 , z = 0 . 

Now non-trivial solutions can exist only if 

a b c 

A — d e / = 0 

g h i 

and in this case there are three possible geometrical configurations. 
Exercise 3.7 

1 . Determine the values of A for which the equations 

(A - 2)x - 2 :y-z = (f| 

* - (A - 3)y + z - 0 
-x — 2y + (A - 2)z = Oj 

have a solution other than the trivial one. 

2. Express one of the equations 

2x + y + z = 0 
2x + 2 y + 3z = 0 
6x + + 7z = 0 
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as a linear combination of the other two, and find the general solution 
of the equations. 

Also show that if any one of the coefficients is altered, the resulting 
equations will have a non-trivial solution. (University of Leicester) 

3. Prove that the values of k for which the three simultaneous 
equations 

*1 + * 3 = kx ± 

—2x x + 2x 2 = kx 2 

— + *2 “ *3 = kX 3 

have solutions other than xi = X 2 = xz = 0 satisfy the cubic equation 
- 2k 2 + 2 = 0. 

Prove that one root of this cubic equation lies between —1 and 0 
and find this root correct to two decimal places by using an appropriate 
iterative method. (MEI Mathematics I) 


Choleski's method 

3.24 This is one of the quickest methods of solving three simultaneous 
equations in three unknowns. The method is demonstrated with a 
worked example. 

Consider the equations written in matrix form as 



The solution of such equations by the previously explained matrix 
method* involves finding A~\ the inverse of A. Choleski’s method does 
not use an inverse. Instead use is made of the fact that an equation 
such as 



where the matrix of the coefficients has zeros above the leading diagonal, 
is easy to solve. In this case multiplying out gives 

—3 p = 8 => p = —t 

9p + q s= —21 => q = 3 

—5/? — 17 => r = —14 


* See 3.8. 
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3.25 The method involves looking for two matrices L and U so that 
LU = A, and where 

Z, is a matrix with zeros above the leading diagonal, as above, and 
U is a matrix with zeros below the leading diagonal, which is 
arranged to be {1, 1, 1}. 

Now to find L and U, it is noted that 

/-3 0 0\/l u v\ 1—3 2 1\ 

9 / 0 0 1 w) = ( 9 -5 -21 

\-5 m nj\ 0 0 1/ \-5 3 1/ 

— 3 u = 2 =► u = — f 

— 3v = 1 => v = — J 

9u + l = -5 => / = 1 

9v + Iw = —2 => w = 1 

5u + m = 3 => m = —J 

5v + mw + n = 1 => u = — J 

L U = A 

/-3 0 0\/l -f -i\ /—3 2 1\ 

9 1 0(0 1 1=9-5 -2) 

\-5 -i -j/\0 0 1/ \—5 3 1/ 

Now the original matrix equation was 

Ax = b 
=> Z.t7x = b 

But if Ux = p, then 
But from (ii) 

P 


Lp = b 



Multiplying out 


Hence 
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Multiplying out 

z = —14 

y + z = 3 => j = 17 

* — f y — \z = — I =► x — 4 

Hence the solution set is 



Exercise 3.8 

Use Choleski’s Method to solve the following equations: 

1. x — 2y + z = — U 2. 2x + y — z = 0^ 

7 - z = 1 x- 7 + z = 6 

3x + y — 2z = 4) x + 2y + z = 3; 


Some special cases 

3.26 The special cases are demonstrated by the following examples: 
Example l 

x + y + z = 3 ' 

2x — y + 3z = 4 
4x + y + 5z = 10, 



x 




To find L and U 

L U = A 

(1 0 0W1 u v\ (l 1 1\ 

(2 / 01(0 1 w = 2 -1 3 

\4 m n }\0 0 1/ \4 1 5/ 


u = 1 
t>=l 

2w + / = -l=> l = -3 

2v + /w = 3 => w = —£ 

4m + /w = 1 =>* m = —3 

40 + wm + n = 5 => n = 0 


Hence 
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Hence 


Now if 


then 


L U = A 


0 0\/l 1 1\ /I 1 1\ 

-3 0 0 1 - 2 -1 3 

-3 0/ \0 0 1/ \4 1 5/ 


L p = b 
/I 0 0\ fp\ / 3\ 

2 -3 0 U = 4 

\4 -3 0 ]\r] \10i/ 

P = 3 

2p — 3 q = 4 => q = f 
4 /? — 3 # = 10 
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which is satisfied by the above values of p and q, r is undetermined. 
Hence 

p-M 


Hence 


Ux = p 



x+y + z = 3=> x = 3 


2 + r 


7 — 4r 


r = 


Hence the solution set is the infinite set of points 


/7 — 4r 2 + r 
[ 3 ’ 3 ’ 


') 


for all real values of r.* 


* cf. p. 51. 
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Example 2 

2* - 3 y + z = 4\ 

4x — 67 + 2z = 51 

X _ y- z=l) 

A x = b 

(2 -3 1\ (x\ /4\ 

4 -6 2 (j =15 

\l -1 -1 )\z] \l/ 

L U A 

12 0 0\/l u v\ 12 -3 1\ 

(4 / 0 (0 1 w = |4 —6 2) 

\l m n )\0 0 1 / \l -1 - 1 / 

Hence 2 m = —3 => m == — f 

2u = 1 => v = J 

4m + / = —6 => l = 0 

4v + Iw =* 2 which is satisfied by any value of w 

u + m = —1 => 7w == i 
v + wm + n = —1 => n = —£ — Jw 



Hence 2p = 4 => p = 2 

4/? ss 5 => p = I 


which is inconsistent. Hence there is no solution. 

Exercise 3.9 

Use Choleski’s method to find the solution, if it exists, of the following 
equations. 

1. x+3y + 2z= n 2. 2x+ y - 3z = 4^ 

5x + 2y + 7z = —2 3x — 4y + 5z = 2 

x + 2j + z = 2j 5x — 3y + 2z = 6, 
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3. 6x- 2 y+ 4z = 8^ 4. 4* - 6y + 2z = 5 1 

-9* + 3^ - 6z = —12 > -6x + 9^ - 3z = 15 

36jc - 12j + 24z = 48J x + y + z = 2, 

Classification of the solutions of linear equations* 

3.11 Definitions. The rank of a matrix is the ordert of its largest square 
‘sub’-matrix whose determinant is non-zero. The rank of a null matrix 
is zero. A set of linear equations with one or more solutions is called a 
consistent set. If there is no solution the equations are inconsistent. A 
set of linear equations Ax = b is consistent if and only if the rank of 
the augmented matrix (containing A with the additional column b and 
written Ajb is equal to the rank of A. This common value of the rank is 
called the rank of the set r. 

A consistent set of linear equations Ax = b is n unknowns has a 
unique solution if and only if r = n. Given a set of linear equations 
Ax = b in n unknowns, then 

either (1) they are inconsistent and rank A < rank Aj b 
or (2) they are consistent and rank A = rank Ajb *» r 
In this case we have 

(a) a unique solution if r = n 
or (b) an infinite number of solutions if r < n. 

Example 1 

Consider the set of linear equations 



Now the rank of A must be 3 or less, and we only need to find one 
non-zero determinant of order 3 to show that the rank of A is 3. 
Now \A\ = 10. Hence the rank of A — rank of A/b = 3, and the set 
has a unique solution. 

* This is a brief note only and is a guide to further work; it is in no way a 
justification. 

t The order of a square matrix or determinant is not usually given as 2 x 2 or 3 x 3 
but left as 2 or 3. 



72 


Matrices: Pure and Applied 


Example 2 

Consider the set of linear equations 



12 1 -3\ 12 1 —3 4\ 

i.e. ^4 — 13 —4 5 and Afb = 3 -4 5 2 

\5 -3 2/ \5 -3 2 6/ 


Now |4| — 0, hence the rank of ^4 is less than 3. 

2 1 

Consider - , = —11. Hence the rank of A = 2. 

3 —4 

Consider the 3x3 submatrices of A/b, i.e. 


■2 1 -3\ / 1 -3 4\ J2 —3 4\ 12 1 4\ 

3 -4 5 , -4 5 2 , 3 5 2 , 3 -4 2 

,5 -3 2/ \—3 2 6/ \5 2 6/ \5 -3 6/ 


Each of these have determinant zero. 


Hence rank A = rank A /b = 2. 

Since r = 2 and « = 3 we have an infinite number of solutions. 


Example 3 


x — 2y + 2z == 3 
2x — 4j + 4z = 1 
3x — 3j — 3z = 4 



Ml 

Consider 


0, but 


1 —2 
1 -4 


—2 and so the rank of A is 2. 


A/b = 


-2 2 3\ 

-4 4 l) 

-3 -3 4/ 


Now 


-2 

-4 

-3 


2 3 
4 1 
-3 4 


= 60. Hence the rank of Ajb is 3. 


Since rank A < rank ^4/b there is no solution. 
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Exercise 3.10 Miscellaneous 

1. Find the vertices of the tetrahedron given by the intersection of 
the planes 

x + 2y + 3z = 0 
2x+ y - z = 0 
x + 3y — z = 0 
x+ y+ z = 1 

2. Obtain general solutions for the equations 

x + ay + /3z + y = 0 
x + py + yz + v. = 0 
x + yy + a.z + (} = 0 

in the cases 

(i) <x, /?, y all different. 

(ii) a ^ p = y. 

(iii) a s= jg = y. ( CS) 

3. Solve the following equations completely 

2x + 5 y + z = a 
x + 3y + az = 1 
3x + 87 + bz = c 

In particular, for what values of a, b, c have these equations 

(i) no solutions, 

(ii) more than one solution? (CS) 

4. For what values of a, b and c are the following equations consistent ? 

x+ y+ z = 1 
ax + by + cz = 0 
a z x + b 2 y + c % z = 0 

Solve them completely when they are consistent. (CS) 

5. Investigate the solutions of the equations 

3 ax — 6 y +18 z = b, 4x + 5y — z = —2, x — ly — z = 3 
in the cases 

(i) a = 3, b = 21. 

(ii) a = —17, 6 = 1. (i^D 
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6. Investigate the solutions of the equations 

a 2 x + ay + z = a 2 , ax + y + bz = 1, a 2 bx + y 4* bz = b 
for all possible values of a , b. 

(MT) 

7. If k is given to be real, show that the equations 

kx + y + z = 2 
2 x + ky + z=l 
-4x + y + kz = 1 

have a unique solution, in which x = 2kj(k 2 + k + 2) unless k takes 
one exceptional value. Also show that in the exceptional case there is 
no solution. ( University of Leicester) 

8. Find when the equations 

x + y — 2z = 0 
ax + by + cz = 0 
bx + cy + az = d 

are consistent, and solve them completely when they are consistent. 

(University of Leicester) 

9. The numbers a, b 9 c, a, /?, y are all real. Find (do not merely 
quote) the condition on a , b 9 c 9 that the equations 

ax + by + cz = a 
bx + cy + az = ft 
cx + ay + bz = y 

should not have a unique solution for x 9 y 9 z. 

Separate the condition into two cases, and in each case find the 
conditions on a, /3, y so there should be infinitely many solutions. 

(0. & C.) 

10 . 


A B _C _ D E F 



, 1m 

2m 

! 3m 

lm 1 2m 

2 kg 


X 

\y 

i Z 




5 kg ] 



A light beam AF is loaded with weights at A, D and F. The beam 
is supported by posts at B, C and E. By resolving and taking moments 
form three equations, and solve these to find the thrusts X 9 Y and Z 
in the posts necessary to maintain equilibrium. 
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Determinants and Curve-fitting 

3.12 Determinants can be used to find the equation of a curve or 
surface passing through a given set of points. To illustrate the method 
in its simplest form, consider the equation of a straight line. 


Example 1 

A straight line has the form 

Ax + By + C = 0 

and uses three constants. Its equation is equivalent to 

Px + Q y + 1 = 0 

and this form requires only two constants. Hence any straight line can 
be put in a form using two constants only. A straight line then has two 
degrees of freedom , and it is noted that two distinct points determine a 
line. 

To find the equation of the line passing through a given pair of 
points it is necessary to find the constants P and Q. If the line is to pass 
through the fixed points (xi, yi) and (x 2 , y^) and also the general 
point (x, y) then 

Px + Q y + 1 = 0 
Pxi + Q yi + 1 = 0 
PX2 + Q J2 +1=0 

From these three equations P and Q can be eliminated. 

Consider the equations as a set of three homogeneous equations in 
three unknowns, P, Q, R. 

Px + Q y + R = 0 
Pxi + Qyi + R = 0 
Px 2 + Q J 2 + R = 0 

where R = 1. 


The equations cannot have the trivial solution P = 0, Q = 0, R = 0 
since R = 1. Hence the equations can have only a non-trivial solution. 
This solution exists only if 


x y 1 
*1 yi 1 
x 2 J>2 1 


= 0 


This equation is now the condition that the line Px + Q y + 1 = 0 
passes through the variable point (x, y) and the given fixed points 
(xi, y{) and (x 2 , 72 ). Hence the equation gives, in determinant form, the 
equation of the straight line required. 
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Worked Example 

The equation of the straight line through ( 1 , 2) and (— 7 , 3) is 

x y 1 

1 2 1 =0 

-7 3 1 

=> x + — 17 = 0 

The same technique can be applied to finding the equation of any 
algebraic curve or surface through a set of fixed points. 

Example 2 

A general circle has an equation of the form 

x 2 + y 2 + 2gx + 2fy + c = 0 

This equation is equivalent to 

P(x 2 + y 2 ) + Qx + Ry + 1 = 0, provided P ^ 0 

The equation now involves only three constants. This means that three 
distinct points are necessary to define a circle. Geometrically this can 
be justified since a unique circumcircle can be constructed around the 
triangle formed by the three given points. If the circle is to pass through 
the fixed points ( xi 9 yi ), (* 2 , 72 ), and (X 3 9 yi) and the variable point 
(x, y), then it is necessary to solve the following equations for P, Q and R 

P(x 2 + y 2 ) + Qx + R 7 +1=0 
P(*i + jf) + Q*i + Ryi + 1=0 
P(*i + jD + Q *2 + Rj >2 + 1 = 0 
P (*3 + yt) + Q ^3 + Ryz + 1 = 0 

Again these can be considered as a set of four homogeneous equations 
in four unknowns, P, Q, R, S where S = 1. Again since S = 1* a non¬ 
trivial solution is required. 

A non-trivial solution exists only if 

x 2 + y 2 x y 1 
x\+y\ xi yi 1 
x\ + yi x 2 y 2 1 
X i+A x 3 y 3 1 

This equation gives the condition that the four points (xi, ji), (^ 2 , 72 ), 
(X 3 , yi) and (x, y) lie on a circle. Hence it gives the equation of the 
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circle required. It was required, however, that P^O. Algebraically 
this means that the cofactor of x 2 + y 2 cannot equal zero 

Ji 1 

i.e. X 2 y<z 1 #0 

*3 J3 1 

Geometrically it is seen from the first example that this is the condition 
that (xi, ji), (x 2 , yz) and (x 3 , 73 ) do not lie on a straight line. The same 
technique can now be applied to other problems. 

Worked Example 

Find the equation of a rectangular hyperbola with its asymptotes 
parallel to the coordinate axes, and that passes through the points 
(6, 6), (5, 10), (-3,-6). 

The equation of a rectangular hyperbola with centre the origin and 
with its asymptotes along the coordinate axes is 

xy = c 

If the centre is not at the origin but is at (h, k ) then the rectangular 
hyperbola would have an equation 

(x - h)(y — k) = c 
xy — kx — hy + hk — c = 0 

This equation has the form 

Pxj + Qx + Ry + 1 = 0 

where P ¥= 0 

Hence the equation of the hyperbola required is 

xy x y 1 

36 6 6 1 =0 

50 5 10 1 

18 -3 -6 1 



6 6 1 

36 6 1 


36 

6 1 

1 I 36 

6 6 

=> 

5 10 1 

xy — 50 10 1 

X + 

50 

51 

1 

O 

5 10 


-3 -6 1 

18 -6 1 


18 

-3 1 

1 118 

-3 -6 


=> 48 xy + 96x — 144 y — 1440 = 0 
=> xy + 2x — 3y — 30 = 0 
=> (x - 3 )(y + 2) = 24 
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y 



Note that the coefficient of xy ^ 0 

6 6 1 

i.e. 5 10 1 ^0 

—3 -6 1 

This is the condition that the points (6, 6), (5,0), (-3,-6) do not lie 
on a straight line. 

Worked Example 

Find the equation of an ellipsoid of revolution about the z-axis, with 
centre the origin, and with its axes parallel to the coordinate axes, such 
that it passes through (0, 4, 6) and (3, 4, 0). 

The form of the equation required is 



=> b 2 {x 2 + y 2 ) + a 2 z 2 — a 2 b 2 = 0 
=> P(* 2 + y 2 ) + Qz 2 + 1 = 0 





Linear Equations 


79 


Hence the equation required is 


x 2 + y 2 

z 2 

1 


0 2 + 4 2 

6 2 

1 

= 0. 

3 2 + 4 2 

0 2 

1 


x 2 + y 2 

z 2 

1 


16 

36 

1 

= 0 

25 

0 

1 


36(x 2 + y 2 ) + 

9z 2 

- 900 


X 2 y2 z 2 
=> — -I- — 4-ass 1 

5 2 ^ 5 2 ' 10 2 


Exercise 3.11 

1. Use the determinant method to find the equation of a line passing 
through 

(a) (2,-1) and (-2,3) 

(b) (1,1) and (2, 1) 

(c) (a, 0) and (0, b) 

2 . Use the determinant method to find, where possible, the equation 
of a circle passing through 

(a) (9, -7), (6, 2), (-3,1) 

(b) (0,8), (6,0), (6, 8) 

(c) (1, 1), (4, 7), (-3, -7) 

(d) (0,0), (-1,1), (6, 8) 

3. Use determinants to find, where possible, the equation of a plane 
passing through 

(a) (2, 0,0), (0,3,0), (0,0, 4) 

(b) (4, 1,8), (3,-1,6), (1,-5, 2) 

(c) (14,4,11), (1,2,41), (33,1,7) 

4. A parabola Pi, with vertex the origin, and with its axis lying along 
the x-axis has an equation of the form 

y 2 — 4 ax 

What is the form of the equation of a parabola P 2 whose axis is 
parallel to that of Pi, but whose vertex is the point (h , k)l Express in 



80 


Matrices: Pure and Applied 


determinant form the equation of a parabola with its axis parallel to 
the x-axis, and that passes through the three points 

{x 2 ,yi), (xz, yz) 

Find the equation of such a parabola passing through 
(-1,3), (-4,-9), (-1,-5) 

5. Find in determinant form the equation of a parabola with its axis 
parallel to the y -axis and passing through three fixed points, 

(xi 9 yi), (X 2 ,y 2 ) and (x 3 ,j 3 ) 

Find the equation of such a parabola passing through 

(-1,4), (0,5), (-4,13) 

6. Find in determinant form the equation of a sphere, with its centre 
not at the origin, passing through each of a set of four fixed points of 
the form 

(Xr 9 y r ,zr) 9 r = 1, 2, 3,-4 

What is the geometrical significance of the non-vanishing co-factor of 
the term in (x 2 + y 2 + z 2 )? 

7. Find the equation of the hyperboloid of one sheet of the form 



that passes through the three given points 

(3,4,12), (0,4,0), (5,0,16) 

8. Find the equation of the central conic, centre the origin, with axes 
along the coordinate axes, and that passes through 

(a) (26, 5), (-40, 16) 

(b) (7,48), (20,30) 

(<?) (15, 12), (41,40) 

(d) (4, -2), (-6, -3) 

(e) (15,20), (7,24) 

9. (a) The general second-degree equation in two variables is a conic, 
which takes the form 

ax 2 + 2 hxy + by 2 + 2 gx + 2 fy + c = 0 
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How many points are necessary to determine a conic ? If these fixed 
points are of the form (x r , y r ) where r = 1,2,. . write in determinant 
form the equation of a conic passing through these fixed points. 

(b) Find the equation of a conic passing through 

(0,3); (0,-3); (-3,0); (1,-4); (1,3) 

10. How many fixed points determine a general third-degree equation 
in two variables? Find in determinant form the equation of a cubic 
curve passing through these points. 

11. How many fixed points determine a general second-degree 
equation in three variables. Find in determinant form the equation of a 
quadric surface passing through these points. 

12. Write in determinant form the equation of a four dimensional 
linear surface passing through a set of fixed points. What condition 
must be satisfied for the surface to be determined? 



4 

Matrix Transformations 


4.1 The ideas of simple matrix transformations will all have been met 
with before. (For many examples and details see Matrices /, Matrices II 
and Exercises II in the Contemporary School Mathematics series.) 
We only give here a brief reminder of these ideas. 


Transformations of the plane using 2x2 matrices 

4.2 If a column vector is pre-multiplied by a 2 X 2 matrix then the 
vector will be ‘transformed’ into a new one. If 



is transformed to 



we also speak of the ‘transformation’ or the mapping of the correspond¬ 
ing point, i.e. ( x , y ) to (x' 9 /). 


4.3 We consider the transformation of the points (1,0) and (0, 1), 
i.e. the transformation of the vectors 



Now, any point (a, b ) has associated with it the vector 
may be expressed as 




, which 


Now if M is a 2 x 2 matrix, then we can transform every point of the 
plane using this matrix. Now* 



* Using ideas of matrix addition and multiplication as in Chapter 1. 
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Now 

m(J) and J/Q 

are column vectors representing the images of the points (1, 0) and 
(0, 1) under the transformation. Hence if we can find the images of 
the points (1, 0) and (0, 1) we can find the image of any other point 
(a, b ). To do this it is easiest to use a unit square.* 

4.4 Example . Investigate the transformation of the (x, y) plane 
represented by the matrix 

gd 

We consider, first of all, the transformation of the points lying at the 
vertices of the unit square . This will indicate the transformation of the 
whole plane. 



e dg 


^4 - l \/0 


G " n/I 


i.e. (1, 0) is transformed into (4, 2) 

^ j i.e. (0, 1) is transformed into (—1,1) 
l) (l) “ (3) ** e * ^ * s trans ^ orme ^ into (3, 3) 


The unit square has become distorted as shown. Hence we expect 
the transformation of the plane to be something like this! 

* The unit square has vertices (0,0), (1, 0), (0, 1) and (1, 1). 
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The original plane The transformed plane 


4.5 We can, rather than considering the transformation of the vertices 
of the unit square, consider the transformation of lines (through the 



Consider points of the line y = x , i.e. (1, 1), (2, 2), (3, 3). These are 
transformed into (3, 3), (6, 6), (9, 9)—they still lie on the same line. 
In the same way we can consider the lines y = —x, y = 2x, y = —2x, 
y = i*>y= y = 4x,y= -4x. 

The inner area represents the original whilst the area outside the 
ring shows the plane onto which the position vectors are transformed. 

important: Note that the points of the lines which do not remain 
the same under the transformation ‘close up’ on one of those that does; 











Matrix Transformations 


85 


in fact, the whole plane ‘closes up’ in the direction of this line or lines. 
The lines which remain the same under the transformation are in the 
directions of the eigenvectors of the transformation matrix. These 
eigenvectors are such that 


G !)C)-*G)- w haekeR 

|~Here ^ and Qj are eigenvectors—corresponding to the fact that 
y . == x and y == 2 x remain the same lines under the transforma¬ 
tion.! Eigenvectors are dealt with in detail in 4.22. 


Examples of other transformations 

4.6 (a) Enlargements centred on the origin 




y . 

l 

i 



U t 


/ ""— * 

X 





Positive and negative 
values of k give 
enlargements of the 
triangle as shown 


Original plane 


Transformed plane 


The typical matrix for an enlargement is 



where k e R. 


Associated transformations 

d> —- 



The matrix for this transformation would be 



,keR. 
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Negative values of k will, of course, give a transformed picture which 
will be a rotation together with an elongation. 


4.7 (b) Rotations 

To find the matrix which represents an anticlockwise rotation of a° 
about O. 



and 


e 

e 



b\( 0 


d \1 


(b\ 

\d) 


Hence we find the ‘images’ of the points and under this 

rotation. They are | a 1 and ( —sin a I. The required matrix is then 

\sin a/ \ cos a/ 

( cos a —sin a\ 

sin at cos a/ 
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This can be used to give the formula of cos (a + P) for example since a 
rotation through a + P is a rotation through a followed by a rotation 
through jft, i.e. 

( cos (a + p) —sin (a + P)\ __ /cos /? —sin /?\ /cos a —sin a\ 

sin (a + P) cos (a + p)J “ \sin /? cos /?/ \sin a cos a/ 

and comparing entries 

cos (a + (S) — cos a cos {} — sin a sin 

sin (a + P) = sin a cos + cos a sin P 

4.8 (c) Reflections 

To find the matrix which represents a reflection in the lin ey = x tan 0. 
The image of (1, 0) is (cos 20, sin 20). 



The image of (0, 1) is (sin 20, — cos 20). 
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The matrix representing this reflection is thus 

/cos 20 sin 20\ 
\sin 20 —cos 20/ 


4.9 ( d ) Shears 



A shear in the x direction (which may be to the right or to the left 
depending on k ). 

A typical matrix is 


n 

Note that the determinant must be 1. (Parallelograms on the same base 
and between the same parallels are equal in area.) 




A shear in the y direction (which may be up or down, depending on 
the sign of k ). 

is a typical matrix of the transformation, where ke R. 


n o 
\k 1 
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4.10 (e) Transformation involving a singular matrix 

Consider, for example, the transformation of the unit square by the 
matrix 



The unit square is transformed into part of the line y = \x. 
4.11 (/) Transformation by a general 2x2 matrix 



In the above example, it has been assumed that ad — be > 0. 
What happens otherwise? 

Determinants and the transformations 

4.12 As above, the matrix ^ transforms the unit square in 
general, into a parallelogram. 
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We shall interest ourselves in the ratio of the areas OABC-OA'B'C'. 
Now the area of OABC is one square unit and the area of OA B'C' 
is twice the area of A OA'C. 



Now A OA'C' = OPQR - (A OAP’ + A A'QC' + A C'RO) 

— ad — \{ac + (a — b)(d — c) + bd) 

= \{ad - be) 

So OA'B'C' = ad — be 

If the transformed figure is not as shown, but as below, then the area is 
\{bc - ad). 
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Hence the area of the transformed figure is ±{ad — be). The number 

A = ad — be is the determinant (see 1.4) of the matrix \ .1 

. . la b\ . ' V d) 

IS written I r n-»-io nn-mKar /latoiMviitiao tVio WI n n-ntfirtn * /\f 


c d\ 


This number determines the ‘area magnifi cation’ of 


the transformation, as opposed to linear magnification. 


Successive transformations 


4.13 Suppose two successive transformations are made so that 
(x, y) is transformed into (x' t /), where 


ax + by = jc'1 
cx + dy = y'j 

(i) 

-(: S)GK : ) 

(2) 

and then (x\ y') is transformed into (x\ y") where 


px‘ + qy' = x'J 
rx’ + sy‘ = y'j 

(3) 

-C ^)(r) “ (;-) 

(4) 

Substituting we have ((1) into (3)) 


p(ax + by) + q(cx + dy) = x’) 
r{ax + by) + s(cx + dy) = y’j 

(5) 

and also 


G:)[(: 30]-G : ) 

(6) 


from (2) and (4). 
(5) gives 


{pa + qc)x + {pb + qd)y = x'| 
(ra + sc)x + (rb + sd)y = y’j 


(pa + qc pb + qd\ /x\ _ /x*\ 
\ra + sc rb + sd) \y) \y') 


or 



92 


Matrices: Pure and Applied 


Comparing this with equation (6) we have 

Ip ( a = (P a + <i c P b + 

\r s) \c d) \ra + sc rb + sd) 

which is our rule for multiplying 2x2 matrices.* 

Transformations of the type given in equation (1) are called linear . 


Translations and transformations with 3x3 matrices 

4,14 It is not possible to transform into ^ using 2x2 

matrices. This transformation represents a translation a units in the 
x direction, b units in the y direction. 



A translation T( a , b) of the unit square. 

This translation can be represented in a number of ways 



With 3x3 matrices however, it is easy to achieve since 


Hence 


/I 0 a\ fx\ fx + a\ 
0 1 &)( y\ = .y + b\ 
\0 0 l/\l/ \ 1 



represents a translation through a units in the x direction and through 
b units in the y direction. 

* Hence it is possible to justify the rule for multiplication of matrices by starting 
with a definition of a linear transformation, 
t See the section on Affine Geometry, p. 223. 
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To enable us to combine translations, rotations etc., we can easily 
extend our 2x2 matrices into 3x3 ones. For example, a rotation 
through an angle 0 anticlockwise which is represented by 


/cos 0 

—sin d\ 

/cos 0 

—sin 0 

°\ 

becomes 1 sin 0 

cos 0 

0} 

\sin 0 

cos 0/ 

\ o 

0 

l/ 


A reflection in the x axis, represented by 


/i n\ 

/i 

0 

°\ 

(j ? 

becomes 1 0 

-1 

0 

\0 —1/ 

\o 

0 

1/ 


and so on. We then find the image of 
coordinates (x, y). 


representing the point with 


Combining transformations 


4.15 Example L To find the matrix representing a reflection in the 
line y = x tan a. The idea is to apply successive simple transformations 
to achieve the required result. Hence 


Apply R- a , 

then X, 


and finally 7? a , 


a rotation through an 
angle —a 


a reflection in the x axis 


a rotation through an 
angle a 


( cos a +sin a 
—sin a cosa y 



( cos a —sin a 
sin a cos a 



The matrix representing a reflection in y = x tan a is given by 


( cos a —sin a\ /I 0\ / cos a sin a\ * 
sin a cosa/\0 —1/\—sina cosol) 

( cos a — sin a\/cos a sin a\ 
sin a cos a/ \sin a —cos a/ 


= ( w 
\si 


cos 2a sin 2a' 
sin 2a —cos 2a 


:) 


* Remember matrix multiplication is associative, hence there is no ambiguity 
in writing ABC for the product of the matrices A , B and C. 
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4.16 Example 2. To find the matrix representing a reflection in the 
line y = mx + c . 

Suppose that m = tan a. As before 
Apply 


T(o.-c), a translation of —c in they direction 


f ' -“) 

\0 0 lj 


then 


R-a, a rotation through an angle of —a 


X, a reflection in the x axis 


I cos« sin a 0\ 
—sin a cos a 0] 
{ 0 0 1 / 


p _? » 

\0 o 1 / 


'cos a —sin a 0\ 
sin a cos oc 0 
, 0 0 1 / 


Ra, a rotation of through an angle a 
and finally 

T(o. e), a translation of +c in the y direction 
The required matrix is thus T( 0 . C )R a XR-J'(o. -o* which gives 


\o 0 l/ 


/cos 26 

sin 26 

—c sin 26 \ 

(sin 20 

—cos 26 

c(cos 26 + 1) ] 

\ o 

0 

1 / 


on multiplication. If a solution involving m is required, this can easily be 

m 1 

found since if m = tan 0, then / _= = = sin 0 and ^ = cos 6. 


Vl + m 2 

We can illustrate this as shown. 


v 1 + m 2 


* Remember matrix multiplication is associative, hence there is no ambiguity in 
writing ABC for the product of the matrices A 9 B f C. 




+ 




Transformations of three-dimensional space using matrices 

4.17 To investigate transformations of 3d space, we proceed, as with 
2d space except that this time we consider the transformation of the 
‘unit cube’ with vertices (0,0,0), (1, 0,0), (0,1,0), (0, 0, 1), (1,1,0), 
(1,0,1), (0,1,1) and (1,1,1). 

Note again that translations cause difficulty and we would need to 
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Example 

Investigate the transformation of the unit cube represented by the 
matrix 





/I 0 0\ 







o 1 i 







\o 0 1/ 




0 

(0, 

0,0) 

transforms into 

(0, 

0,0) 

0 

A 

(1, 

0,0) 

transforms into 

(1, 

0,0) 

A 

C 

(0, 

1,0) 

transforms into 

(0, 

1,0) 

c 

G 

(0, 

o, 1) 

transforms into 

(0, 

h 1) 

G' 

B 

(1. 

1,0) 

transforms into 

(1, 

1,0) 

B 

D 

(i. 

0 , 1) 

transforms into 

(1, 

i 1) 

D' 

F 

(0, 

1,1) 

transforms into 

( 0 , 

f, 1) 

F' 

E 

(1. 

1,1) 

transforms into 

(1, 

f, 1) 

E' 



Orthogonal* transformations 
4.18 Definition 

A square matrix A is said to be orthogonal if 

A'A = / or A'^A- 1 


* We assume some knowledge of vectors here, i.e. 

(i) Scalar product of two vectors and is (oc fi) = ay + 

(ii) cose -™ 

(iii) Orthogonality when 6 *= 90°, i.e. a . b = 0. 
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[A' is the transposed matrix of A, i.e. rows and columns of A are 
interchanged.] Since 

AA- 1 = A- 1 A = I 
then AA' — A'A =/ 


Typical examples of orthogonal matrices are 



4.19 Properties 

(a) If A and B are orthogonal we have 

AA' = A'A — I and BB' = B'B = I 

Hence 

(ABXAB)' = AB . B'A' (since (AB)' = B'A ') 

= A{BB')A' 

= AIA' 

= AA' 

= / 

Hence the product of two orthogonal matrices is another orthogonal 
matrix. 

(b) The inverse of an orthogonal matrix is orthogonal and also the 
transpose of an orthogonal matrix is orthogonal. Prove these for 
yourself. 

(c) The determinant of an orthogonal matrix is ± 1, since AA! = I, 

\A\\A'\ = \A\ 2 = |/| = 1 
Hence \A\ = ±1 

(d) Consider a point P (x, y) in a plane. Call the column vector 
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Then 

X'X = (* y) Q =x 2 + / 
which is the square of the distance of P from the origin. 



Apply an orthogonal transformation A to X. Our new co-ordinates 
(xi yi) ^with associated column vector ^j ) = Y, are given by 

Y = AX 

since Y'Y = xf + yf and O remains the same point, 

Y'Y = (AX )'. AX 
= X'A'. AX 
= X'X 

i.e. xf + yf = x 2 + y 2 

Hence under an orthogonal transformation, distance is preserved. 

(e) Another important feature of this type of transformation is that 
the angle between two vectors is invariant (i.e. remains the same). 
Consider 



where 
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a *i*2 + yiy 2 

cos 0 = ..= 

Vxf + y {. Vxf + 


which is also given by 


{X'XJHY'Y}* 


Suppose we now apply an orthogonal transformation A to each of X 
and Y so that 

Z = AX, where Z = ^j 

T = AY, where T = 

Now cos* = {Z ' Z} i{ T ' T} i 

(v4X)'. AY 

~ {(AX)'(AX)} i {(AY )'. (^Y)>* 

X'A'AY 

“ {X'^'^X}*{Y'>4'^Y}* 

X'Y . „ , „ 


{X'XJHY'Y}* 


because A'A = I. 


= cos 0. 


Hence 6 = <f>* and the angle is preserved by an orthogonal transforma¬ 
tion. 

General conditions for a 2 x 2 matrix to be orthogonal 
4.20 Suppose 

-n 

then we require = /, or 

e;)(: 3-e ?) 

so, comparing entries: 

a 2 + c 2 =* 1, ab + cd = 0, h 2 + rf 2 = 1 
If ae.R and ce.R 

then a 2 >0 and c 2 > 0 


For justification, read on. 
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Hence if a 2 + c 2 = 1, 

a 2 < 1 and c 2 < 1 

and — 1 < c < 1 

Similarly for b and d. 

Hence it is in order to let a = cos 0, b = cos <f> 9 c = sin 0, d = sin 
and we have 

cos 0 cos </> + sin 0 sin <f> = 0 
Le. cos (0 -(/>) = 0 

=>9-^ = ^ or J-f=j 

There are only two 2x2 matrices which satisfy our conditions 

/cos 6 — sin 0\ /cos 6 sin 0\ 

\sin 0 cos0/ an \sin 0 —cos 0/ 

The first matrix represents a rotation of 0° about the origin, whilst 
the second corresponds to a rotation of 0° followed by a reflection in 
the new x-axis (i.e. z axis as shown). 

In the first case the determinant is 1, in the second —1. 

Case 1 Case 2 




Quadratic form 

4.21 We define a quadratic form Q(x, y) as 

Q(x, y) = ax 2 + 2 hxy + by 2 

*=(;)•-« 

X' = (x y) 


Let 

Then 
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and 


X'+X = (x v) ( a , * , 

7 \ h b ! \yj 

(ax + hy\ 

= {X ^[hx + by) 

= ax 2 + 2 hxy + by 2 

= Q(^./) 

.4 is called the matrix of the quadratic form Q(x, y). Note that A is 
symmetric (i.e. A' — A). The eigenvectors of a symmetric matrix are 
orthogonal (see below). 

4.22 Example. Find the eigenvectors of the matrix 

-6 4) 

and show that they are orthogonal. 

Eigenvectors were mentioned on p. 85. They are those whose direction 
is unchanged by the matrix transformation A, i.e. 

6 4)GHG) 

x + 2y = 2x| 

^ 2x — 2y = Ay) 

_ x(l — 2) = —2 y 

2x = (A + 2)y 
x = 0, y = 0 
1-2 -2 


and since we do 


not require the solution 


2 2 + 2 
=> (1 - 2)(2 + 2) = -4 
=> 2 2 + 2 - 6 = 0 
=>2 = 2 or 2 = —3 
When 2 = 2 

x + 2y = 2x 
=> x = 2y. 

Any vector satisfying this relation is suitable, e.g. . When 2 = — 3, 

x + 2y = —3x 
i.e. —iy = x 


l 1 \ 


A suitable vector is 
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The scalar product of these two vectors is given by 


(2 1 ) 

Hence they are orthogonal. 




4.23 Prove that this is true in general, i.e. the eigenvectors of a 
symmetric matrix (with unequal eigenvalues (values of A)) are orthogonal. 
Consider only the 2 x 2 matrix 


-u 


Orthogonal matrices have already be defined. If we consider the columns 
of an orthogonal matrix as column vectors, then we notice that these 
column vectors are of unit length and are mutually orthogonal. 


Example . For 


C 


we have 


(! f) 


= 1 and (£ 


-«(- 1 ) - 


Hence each column vector is of unit length. Also 


and 


(-!)-» 


«» 


<* 


Hence the column vectors are mutually orthogonal. 
Now if the eigenvectors are 
( 2 \ 


(i) and U) 


we can then make them of unit length by using instead 


i.e. 



1 


and 


and 


V12 + (-2) 2 
-2 
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Now consider the matrix 

V 

Clearly V'V = /, and so V is orthogonal. 

4.24 Prove that it is possible to write 

AV = VD where A = ^ ^ 

V is an orthogonal matrix as above, and D is the diagonal matrix 



where Ai and I 2 are the eigenvalues. 

A transformation of (x, y) into (xu yi) by the transformation 
X = TX 1 transforms a quadratic form with matrix A into one with 
matrix TAT. Now 

XAX = Q (x,y) 

Substituting 

(TXi)'ATXi = X'TATXi 

which is another quadratic form of matrix TAT. Suppose we can choose 
T so that TAT is a diagonal matrix £ with 

«-ca 

and so 

= ei(xi ) 2 + e 2 (x 2 ) 2 
Therefore we choose T = V, then 

T'AT = V'AV and V'AV — D* 
also X = rXi or X * KXi 

so Xi = V'X 

Hence a linear transformation Xi = V'X produces a new quadratic 
form ^i(xi) 2 + A 2 (_yi) 2 , where fa, fa are the eigenvalues of A. 


Ivi Vs 

\\/5 V5‘ 


* See previous page. 
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4.25 This is most useful in the investigation of conies. The application 
of the linear transformation X 1 = V'X transforms the equation of a 
conic X'AX = k into a conic of the form Ai(xi) 2 + fafa) 2 = k 
provided that V represents a rotation (see p. 86) (i.e. Fis not symmetric). 


Example 

Investigate the conic x 2 —4 xy — 2 y 2 = 3, 
Q(x, y) = x 2 — 4 xy — 2y 2 

= (x y) l 1-2 
-2 


(-! :■ 
■ U :5 


Here A 

The eigenvalues are 

2 = 2 and A = —3 
and so the conic is transformed into 

2xf — 2>y\ = 3 

by the application of the transformation Xi = 
V is easily found as 


V'X. 



: be symmetric), and 

/2 

1 \ 

, M 

V5\ 

1 

2 

\V5 

V5/ 


Finally, the transformation is given by 
Xi = F'X 

2 1 


i.e. 


> , '-T5 x + W 


. z 

This represents a rotation through the angle cos -1 
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Hence the conic 

x 2 — 4 xy — 2 y 2 = 3 
can be transformed, by rotating the axes, into 

2xf — 3 y\ — 3 

The new conic is 

4 A _ j 
I ” l " 1 

which is of the form 

X 2 4—1 

a 2 ~b 2 ~ 1 

and so represents an hyperbola with centre the origin, and asymptotes 

xi = ±Vfji 

We may sketch the curve thus 



4.26 The non-homogeneous quadratic form is given by 
Q(*,j) = a* 2 + 2hxy + by 2 + 2 gx + 2 fy + c 
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This can be written in the form 

X'AX + BX+C, where B = (2 g If) 

Application of a rotation by an orthogonal matrix, followed by 
completion of the square will reduce these conics to a standard form. 


Example 

Investigate the conic 

x z — 4xy — 2 y 2 + 6x — 4y + 3J = 0 
As above this becomes, 


(xy) 


U :^) +<6 - 4 ) a +31=0 


We now apply Xi = VX which transforms the conic to be 
2x\ ~ 3yf + (6 - 4) 


V5 

Vs Vll 


+ 3i = 0 


=> 2x? - 3y? + ^=xi - -^yi + 3J = 0 

“ 2 (« + ^) ! - 3 (^ + 3 -^) S - 3 =° 

Which is the standard form for a conic. 

It can be written as 




(f) 2 

We see from this, that if 


12 


= 1 


n + vs ~ x 

» + jjs = Y 


then the conic is in the form 


X*_ Y*_ 

fl 2 b Z ~ 1 
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The transformations 

« + ^ = X. X. + J^ = Y 

are equivalent to a translation. 

Hence the conic is an hyperbola with centre 

(~ V5 _ 3Vl) 

It can be drawn graphically as follows, as in the simpler example above. 
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The point j is subjected to the transformation T, followed by the 

»-C -5' 


* 

further transformation 


Show that the result is equivalent to a reflection in the line y + x = 0. 
Find the equation of the line resulting from the transformations 7i and 
T on the line y — 3x + 2. {MEI Specimen) 


2. Show that the 2 X 2 matrix M a which reflects each point of the 
xy plane in the line y = x tan a is 

( cos 2a sin 2a\ 
sin 2a —cos 2a/ 

If M 0 represents a reflection in the line y = x tan /?, show that M 0 M) 
represents a rotation of the plane, and find the angle of rotation. Find 
the relation satisfied by a and /? if = M a M^. {MEI Specimen) 


3. Describe how complex numbers may be represented by 2 x 2 
matrices. Find A 6 where 


A = 




(Part MEI Pure I) 


4. If 




find det M in terms of k . 
(i) If k = 2, solve 


Mx = O and 


Mx= I s\ 

w 


(ii) If k = 1, solve Mx = O and interpret your result geometrically. 

{MEI Mathematics I) 

5. If M denotes the matrix and / denotes the 2 x 2 unit 

matrix, prove that M 2 = 3 M — 21. 

* [See Matrices 2 Chapter 2.] 
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Prove further that, if n is any positive integer 

M n = (2» — 1) M — 2(2 n_1 — 1)/ (MEI Pure I) 


6. (a) If 


/I 

2 

_1 \ 

2 

4 

-5 

\3 

7 

-9/ 


Find m and A~ l . 

(b) Write the transformations 

X = x + 2 y — z\ X = a + b — c| 

Y = 2x + 4y — 5 z| Y = 3a — 2b + c| 

Z = 3x + 7 y- 9z) Z = 4a + 3 b + 2c) 

in matrix form and hence, by using matrices, express x,y, z 
(i) in terms of X, Y, Z, (ii) in terms of a, b, c. 

7. If B and C are square unequal matrices of the same order and / is 
the unit matrix, why does 

(/ + B){I - B) = (/ - B)(l + B) 


whereas (/ + B)(I — C) is not in general equal to (/ — C)(7 + B)1 If 


show that 


/ 1 

c 

-b\ 


l-c 

1 

a 

and B — I 

\ b 

—a 

1/ 


AA' 

= (/+*)(/ 

II 

o? 

1 


where A' is the transpose of A. 

8. Investigate the transformation of the unit cube by the matrix 



and show that the volume of the transformed solid is given by 


± 


9. Investigate the conics: 


a b 
d e 
g h 



(i) 9xy - 1 = 0, 
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10. Investigate the conics: 

(i) x 2 + 4 xy + Ay 2 + 2x + 4y + 1 = 0, 

(ii) x 2 + 2y 2 - 2x + Ay - 1 = 0. 

11. If a rotation of the coordinate axes about the origin transforms 
the equation 

ax 2 + 2 hxy + by 2 = 1 

into the equation 

a'x' 2 + Ih'x'y' + b'y’ 2 = 1, 

prove that 

a' + b’ = a + b and a'b' — h' 2 = ab — h 2 (O&C) 

12. If the equation of the parabola 

(x sin 6 + y cos 6) 2 + 2 gx + 2fy + c = 0 

is reduced to the standard form Y a = kX by a change of coordinate 
axes (leaving the unit of length unaltered), prove that 

k = 2 (f sin 8 — g cos 6) 

Hence, or otherwise, find the distance between the focus and the 
directrix of the parabola given by the equation 

9x 2 + 24 xy + 1 6y 2 — 10* + 20y = 0 (O&C) 

Exercise 4.2 Miscellaneous 
1. Let A(6) and B(6) denote the matrices 


/cos 0 

sin 0\ 

/cos 0 

—sin 0\ 

\sin 0 

—cos 0/’ 

\sin 0 

cos 0/ 


(i) If 

show that the point (X', is the mirror image of (X, Y) in the line 
0 

y = xtan^ 

(ii) Prove that A(6{)A(0 2 ) = B(<j>) where <f> is to be determined 
and hence explain the relation between the points (X', Y') and (X, Y) 
when 



Matrix Transformations 


111 


(iii) Prove that 

A(0 1 )A(e 2 )A(6 3 ) = A(d 3 )A(0 2 )A(d 1 ) 
and interpret this result geometrically. (CS) 


2. (i) Find a 3 X 3 matrix P such that, for any 3x3 matrix Q, 
PQ is the matrix Q with its second and third rows interchanged. 

(ii) The direction cosines of a line in 3d Euclidean space are 
(/, m, n). Interpret geometrically the linear transformation 


where 


X = Lx 

IP Im In ' 
L — [ml m 2 mn 
\nl nm n 2 > 


Prove that the transformation L 2 is the same as the transformation L. 

t MErsr ) 


3. The equations of motion of a particle in a plane, referred to 
rectangular axes Ox, Oy in the plane are x = ky, y — —kx. Show that 
the equations of motion become Jc' = ky', y = —kx’ and are thus 
unaltered in form when referred to axes Ox', Oy' derived from the 
first by rotation about O through an arbitrary angle a. 

Are the equations of motion unaltered in form when referred to 
axes Ox", Oy* derived from Ox, Oy by reflection in an arbitrary line 
x y 

coTfi = s in t h rou §h origin. Similarly investigate the equations: 


x = k(xy — yx)y, y = —k(xy — yx)x (CS) 

4. If X’ denotes the transpose of a matrix X and A and B are 3x3 
matrices, prove that 

(A + B)' = A' + B' 

and (A.BY = B’A' 

Deduce that the angle between any two vectors in 3 d Euclidean space 
is unaltered when the space is subject to a transformation Y = AX 
where A is a 3 x 3 matrix such that A'.A = I.* (MEI mod.) 


5. Let A, B, C be real 2x2 matrices and write 
[A, B] = AB — BA, etc. 

Prove that 

(i) [A,A] = 0, 

(ii) [[A, B], C] + [[. B, C], A] + [[C, A], B] = 0, 

(iii) [A, B] = I => [A, 5™] = mB m_1 for all positive integers m. 


* See section on orthogonal transformations. 
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At each step you should state clearly any properties of the matrix 
ring which you use, 

The trace Tr (A) of a matrix 


is defined by 
Prove that 



Tr (A) = an + a 22 


(iv) Tr (A + B) = Tr (A) + Tr (B), 

(v) Tr (AB) = Tr (BA), 

(vi) Tr (I) = 2. 


Deduce that there are no matrices satisfying [4, B] = /. Does this in 
any way invalidate the statement in (iii). (MEI Pure III) 


6. The number an + a 22 + a 33 is called the trace of the matrix 


Ian 

£12 

£13\ 

la 2 i 

#22 

£23 I 

w 

£32 

£33/ 


If A and B are two 3x3 matrices, show that the traces of the matrices 
AB and BA are equal. 

If the matrix AB represents a rotation through an angle <f> about the 
directed axis U and A represents a rotation interchanging the axes 
U and V, explain why BA represents a rotation through the angle 
<l> about V. 

Given that the matrix 

( cos cf> sin <f> 0\ 

— sin <j> cos <j> 01 

0 0 1 / 

represents a rotation through the angle <f> about the z-axis, and that the 
matrix C represents a rotation about some axis, find a formula for the 
angle of rotation in terms of the trace of C. (CS) 


7. What is meant by the statement that multiplication of real numbers 
is associative and distributive over addition? 

Prove that, if a#, by, aj (*,/= 1, 2, 3) are real numbers then 
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Deduce that multiplication of 3 x 3 matrices is associative and that, 
if A is a 3 x 3 real matrix, then the notation A 3 , A\ A 5 . . . may be 
interpreted unambiguously. If the 3 x 3 real matrices B and A 3 + A + I 

are non singular (/ is the identity matrix) why should the notation — be 

ambiguous but — + - not so? (J VE IPure III) 

8. Let A be a 3 X 3 real matrix and b a 3 rowed column vector. It is 
proposed to solve the equation 

Ax = b, 

and a particular solution x = x 0 is noted. Prove that any other solution 
may be written in the form 


x = x 0 + u 

where u is a solution of the equation Ax = O. Prove conversely that 
any vector x„ + u is a solution of the equation where x 0 is a fixed 
particular solution of the equation and u is any vector such that Au = O. 
Interpret geometrically the equation and its solutions if A is singular 
(i) when an x« exists, and (ii) when there is no such particular solution x a . 

(MEI Pure III) 

9. A function A maps the ring of real 2x2 matrices to the real 
numbers by assigning to any real 2x2 matrix A , its determinant A A. 
Prove 


(i) A (AB) = AA.AB for all A, B. 

(ii) A (A + B) # AA + AB in general. 

(iii) Under matrix multiplication, the set of matrices A such that 

AA = l forms a group, but under matrix multiplication and addition 
this set does not form a ring. (SMP mod.) 

10. At time zero an observer at the origin O of one coordinate system, 
(x, y, z, t) coincides with an observer at the origin Oi, of another 
(xi, yi, zi, ti). At this time, t = ti = 0, an instantaneous light signal 
is emitted. 

After time t the first observer describes the equation of the wavefront 
as 

x 2 + y 2 + z 2 — c 2 ! 2 = 0 (i) 

where c is the velocity of light. 

Similarly the second observer describes the equation at time ti as 

x?+^ + zf-c2f? = 0 (ii) 



114 


Matrices: Pure and Applied 


Now the two following postulates are fundamental to the special theory 
of relativity. 

I. A description of a physical phenomenon can contain no 
reference, either explicitly or implicitly, to the absolute speed of the 
coordinate frame in which the phenomenon is described. 

II. The speed of light is independent of the motion of its source, 
and is the same for any two observers who are travelling in uniform, 
rectilinear motion. 

Suppose that the relative motions of the two coordinate systems are 
directed along a common x-axis. 

The Galilean transformation from one system to the other is given by 



1 0 0 -V 
0 10 0 

0 0 1 0 

0 0 0 1 



(hi) 


where V is the velocity of Oi relative to O. 


By the two postulates above each observer must find the same form 
of the equation of the wavefront is his own coordinate system. 

(a) By direct substitution of the Galilean transformation (iii) in 
equation (ii), show that this cannot be the case. 

(b) Show that substitution of the Lorentz transformation 



1 c - 

0 

0 

-cV ^ 

Xl\ 

[ Vc 2 - V 2 

Vc 2 — V 2 

n 1 - 

0 

l 

0 

0 

Zi 1 

0 

0 

1 

0 

til 

i - V .. 

0 

0 

c 


\cV c 2 — V 2 

< 

% 

1 

< 



in equation (ii), then the form of the equation does remain the same. 

(c) Find the inverse Lorentz transformation of the x, y, z, t system 
into xi, yi, zi, t±. 

(d) Show that the transformation in ( b ) above could be described by 



Vc 2 - V 2 

0 

0 

-yy 

Vc*— V 2 


0 

1 

0 

0 


0 

0 

1 

0 


yv 


Vc 2 - V 2 | 

0 
0 


Vc 2 - V 2 / 



where the matrix of the transformation is orthogonal, and y' 2 = —1. 



Matrix Transformations 


Hence prove that 

(XI yi Zi jcti) [ x i \ =(x y z jet ) f x 



11. (i) 1fp,qeR, then let the function 

f{x) ~pe** + q(** 

where a =£ 0, b # 0, be represented by 

(i) Show that ^ ^ represents fix). 

(ii) Show that $f(x) dx can be represented by 

+ c 

(iii) What relationship exists between 

(S ») “ d 

Civ) Find g\x) if g(x) = 3e® + 2<r*. 

(v) Find f n ix). 

12. If p,qeR, then let the function 

fix) = pe°* cos bx + qeP* sin bx 

be represented by 

(3 

0) Show that f\x) can be represented by 



/ a b 
\-b a, 


(ii) By considering the inverse of the matrix 
/ a t\ 
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interpret the expression 

_J_/a -MM 
a 2 + M a/ W 
and prove your hypothesis. 

(iii) Use your answer to (i) to find 

d 

— [e 2x cos 3x], 

(iv) Use your answer to (ii) to find 

$e 2x sin 2x dx. 



5 

Powers of Matrices 


5.1 In nearly every situation involving matrices, one problem seems 
common—that of finding the powers of the matrices. A number of 
different methods can be used, and we illustrate some of them. For 
simplicity we shall begin by considering 2x2 matrices and extend 
our ideas as necessary. 


Eigenvalues and eigenvectors 

5.2 We have seen in the previous chapter (p. 84) that under a matrix 
transformation the direction of certain column vectors is unchanged; 
these are called the eigenvectors (or latent vectors or characteristic 
vectors). We consider the problem algebraically here. 


Example 

Are there any column vectors whose direction is unchanged under the 
transformation represented by the matrix 

U -5 - 

We need a column vector such that 

u -?)(;)=*(;) 

o 4x + 2 y = kx\ 

-3x - y = ky) 

(4 - k)x = -2 y ) 

—3x = (k+ l)yf 

117 
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These equations are consistent if and only if 

4-k -2 


—3 k+ 1 
o (4 - k){k + 1) = 6 

o k 2 - 3k + 2 = 0 

o (k- 2)(k - 1) = 0 

o k — 2 or k — l 

(i) k — 2 gives, from (4 — k)x — — 2y , the relation x — — y . 

(ii) k — \ gives 3x = —2y. 

We test a particular column vector such that x = — y to check. 

Suppose, for example, we consider Then 


and in general 


U 4(-0-(-l) 
u-(.i) 


where m is any scalar. Hence if we can find one column vector which 
satisfies our requirements, then any scalar multiple of it will do as well. 

For the relation 3x = —2y we can try the column vector (_ 3 ^- 

- U J)UH(J) 

5.3 In general , numbers k such that 

c xh(;) 

are the eigenvalues of the matrix ( ^) and the corresponding eigen- 

/x\ a ' 

vectors are the values of I 1 for which the equation holds. Remember 
that the eigenvectors are by no means unique. Note also that some 
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matrices have no real eigenvalues. We can consider the problem in 
general. If 


(: XHC) 

[ wh " e (o ?)] 


o(A - kl ) = O 


( 


a — k 

c 


b 

d-k 


= O 


(a — k)x 


cx 


+ by =01 


These equations are consistent if and only if 

a — k __ b 
c d — k 

O (a - k)(d - k) - be = 0 
a - k b 

o A == 7 = 0* 

c </ — k 

or rearranging 

k 2 - (a + d)k + ad - be = 0 

, ^ . (a b\ „ 

5.4 This is called the characteristic equation of the matrix I A. If 

H:$) 


then 


so that 


(a 2 + be b(a + ^)\ 


-c 

{a + d)A = 

A 2 - (a + d )A + (ad - be )I = 0 


+ d) be + 

(a 2 + ad ab + b< 
i ac + cd ad + d 2 


$ 


(i) 


Hence the matrix A itself ‘satisfies’ its own characteristic equation. 

* n.b. This is the condition for the two non-homogeneous equations to have a 
non-trivial solution. 
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This may be extended to larger square matrices and is known as the 
Cayley-Hamilton Theorem . Note that this is not a proof of the theorem, 
just a demonstration that it is true in the case of 2 x 2 matrices. 

5.5 Equation (i) is most useful for finding powers of matrices. 
Suppose we rewrite the equation as 

A 2 - eA +// = O 

i.e. A 2 = eA — //, 

then A 3 — eA 2 + jA = O 
so that A 3 — e(eA —//) + jA = O 
i.e. A 3 = (e 2 — f)A — efl. 

Also A 4 - eA 3 +fA 2 = O 

so that A 4 — e{e 2 — f)A — e(—efl) + f(eA —//) = O 

i.e. ^4 4 = ( e 3 — 2ef)A + {J 2 — e 2 f)I , and so on. 

This means that, in general, the powers of a matrix may be expressed 
in terms of the matrix itself together with a multiple of the unit matrix, 
i.e. 

A n = gA + hi (ii) 

where g and h are scalars which have to be determined (this is in general 
an easy problem as we shall see). 

5.6 Relation (ii) is particularly simple in the case where the eigen¬ 
values of the characteristic equation are equal. 

Suppose that the equal roots are k = A. 

Then the Cayley-Hamilton Theorem (for 2x2 matrices) gives 

A 2 - 2 IA + X 2 I = O 
and A 3 - 2XA 2 + X 2 A = O 

and A 4 - 2 XA 3 + X 2 A 2 = O 

and so on (multiplying each equation by A). On substitution we have 

A 2 = 2XA - X 2 / 

A 3 = 3X 2 A — 2X 3 I 
A 4 = 4 X 3 A - 3A 4 / 

and in general 

A n = nX n ~ l A -(n- 1 )X n I for n > 2 

which may be proved easily by induction. 
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5.7 Note that the Cayley-Hamilton Theorem also gives us another 
method for finding the inverse of a matrix. For suppose 

A 2 - eA +fl = O 

Then multiplying through by A~ x we have 
A - el + fA* 1 = O 

so that fA~ Y — el — A 

A-i = j {el - A} 


5.8 Eigenvalues and eigenvectors are used to find the powers of a 
matrix in the following way. Continuing with our initial example 
where 



and remembering that the eigenvalues were k = 2, with corresponding 
eigenvector jj, and k = 1, with corresponding eigenvector 
we now form a new matrix 



17 = 




with the eigenvectors as its columns. 

It is an easy matter to show now that 


U 4K-! 4H-i 4)6 ?) 

f i -f 

This eigenvector corresponds to this eigenvalue- 1 


or in general 

AU= UD 


where D is a diagonal matrix which has the eigenvalues in its leading 
diagonal and zeros elsewhere. Now if 

AU= UD 

then A = UDZJ- 1 

and A 2 = UDU^ . UDU- 1 

= UD 2 U~ y 

A 3 = UD 2 !/- 1 . UDU- 1 
= UD^U - 1 
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and so on, giving 


A n = UD nu -i 


Now D n is easy to evaluate since 

Mo :)• Mr !.)• Mr ?.) 

/ 2 < 0 \ . „ / 2 * 0 \ 

“(o I 4 /* “ dS ° 1> "“ (.0 1 "J 


D* 

Hence, for example, 

A 6 = UD*U~' 

-x hu -i) 

-u 4(“-5 

-(-2 -SU -5 

_ / 190 126\ 

" V —189 —125/ 

5.9 This method is usually the easiest one to apply; however there 
are problems. In the above example U was a non-singular matrix and 
hence t/ _1 exists, but this is not always the case. If the eigenvalues are 
equal then the eigenvectors are equal or one eigenvector will be a 
scalar multiple of the other, e.g. 


and 


= 2 


This leads to 


MM 


and U i$ Singular, so that t/ -1 does not exist. In this case we can use 
a variation on our original ideas, using the Jordan Canonical Form . 


5.10 Example . To find the nth power of the matrix 
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The characteristic equation is 

ifc 2 - 6k + 9 = 0 
o k = 3, or k = 3 
The idea then is to write 


where 


B = FAF -1 (or £F = FA) 



i.e. a matrix containing the eigenvalues as the leading diagonal with a 1 
in the top right hand entry. 

Then suppose F is the matrix 

n 

and we have 

u :kc % DU D 

where A ~ ad —be 

so 

/ 2 1\ _ l/3a a + 36\/ d —6\ 

\ —1 4/ A \3c c + 3cf/\—c a/ 

_ 1 /3arf — ac — 36c a 2 \ 

A \ —c 2 — 36c + ac + 3arf/ 

By comparing entries we have 

2A = 3 ad + ac — 36c A = a 2 

4A = —36c + ac + 3a<i —A = —c 2 


Effectively we have three equations, 

A = ac 
A = a 2 
-A = -c 2 


Since F is by no means unique we chose the easiest values for A, 
a and c. Take A = 1, then a = 1, c = 1 are possibilities. 

Since A = 1, ad — be = 1 so that, if a = 1, c = 1 then — 6 = 1 
and we could allow for example = 3, 6 = 2. 
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In this case 


To check 


= (! 3) and 1) 

™~-(! 3)6 J)(_! D 

1) 


■ (3 ,o)(- 

=u:) 


= B. 


As previously 


B n = FA ny- 1 
and our problem now is to find A n 
A 


A3 


A 4 


■ c s- 

/3 3 27\ 

“ \0 3 3 /’ 

and so on. The pattern is much easier to recognize in the following form. 

A»=P 23 ) 

'() 3 a I 


- (o ! y 

-c 


3 4 108 
0 3 4 


) 


A 3 


A5 


In general 


- C 3- 

_ /3 3 3.3 2 \ 

“ \0 33 ) 

_ / 35 5 ' 34 \ 
\0 3 4 / 


A 4 


__ /3 4 4.3 3 \ 
” \0 3 4 / 


A » 


_ /3” «3 n_1 \ 
“ \0 3” / 


This may be proved easily using induction. We are now able to find 
the powers of any 2x2 matrix provided that it has real eigenvalues. 


Use of Cayley-Hamilton theorem in calculating powers 
5.11 Suppose that the matrix A, ‘satisfying’ its characteristic equation 
A 2 + IA+ml = O 
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can also be expressed in the form 

A = UDU - 1 

where U and D are non-singular matrices as explained. By substitution 
in the characteristic equation for A we have 

UDU- 1 . UDU- 1 + l . UDU- 1 + ml =0 
or UD 2 .U- 1 + 1. UDU- 1 + ml = 0 

Multiply through this equation in front by U _1 and behind by U, 

U-WDW-W + l. U~ 1 UDU~ 1 U+ mU-HU = O 
i.e. D 2 + ID + ml =0 


Hence the diagonal matrix D also ‘satisfies’ the characteristic equation 
of the matrix A. This enables us to calculate powers easily and quickly. 

/4 — 3\ n /4 —3\ 

Example. To find L ^ I . Call L ^ I the matrix C. 


The eigenvalues are k = 3 and k = 1. 
Now using the Cayley-Hamilton theorem, 

C n =pC + ql 

and similarly D n = pD + ql (as above). 


so that 


Hence 


^ by comparing entries. 


3 W — i n 3-3” 

p = o 9 ? — i 


C n = 


2 

3” - 


1 14 —3\ 3 — 3” /I 0\ 

\i o; + 2 \o i/ 


and the problem is solved. 


5,12 An easier method for finding the powers of matrices, which works 
in the cases both when the eigenvalues are different and when the 
eigenvalues are equal is as follows. 


Theorem 



9 

s — fa 



h n 

0 


h n ) 


where k n = ki n_1 + h n ~ z h + Ai» _3 A| + . . . + fa™- 1 
This may be proved by induction. 
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Suppose the formula is correct then 

— h q 

s — X%) 


e r-*4 To v) 

« e t p ' h ,u)T~ h ,-*j(o d 

This may be proved since Ax, A 2 are the roots of the characteristic 
equation A 2 — (p + s)A + ps — qr = 0. 

Hence 

/> ?\” +1 _ k (p-h ? \P 2 °\ , (P q\(h* 0\ 

V sj ~ kn \ r 5 -A 2 AO Ax/ \r 0 A 2 B J 

=('7* si,Mo i) + (T 

But fen (o Ai) + ft a!») = fc ” +1 (o l) 

{since k n +i = Ai“ + Ai b-1 A 2 + . . . A 2 B } 

Ip q\ n+1 , (P — A 1 ? \ . /Ai n+1 0 \ 

S0 \r s) ~ r 5 - A 2 ) + 1 0 A 2 B+1 J 

This is clearly true for n = 1 and hence n = 2, 3 ... 


Ax» +1 0 

A 2 b+1 


) 


Exercise 5.1 

Questions 1-4 involve matrices with distinct eigenvalues, Questions 
5-8 involve matrices with equal eigenvalues. 

(5 2\ 

1. If M is the matrix I ^ 4, find the values of 

(i) M\ (ii) M n . 

2. Find the value of P n where 



and find the limit of P n as n —► oo. 

3. Find Q 6 and Q n where 


-* 1 
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4. Find R 7 and R n where 


5. Find S 5 and S n where 


-G5) 

-c -:) 


6. Find T 4 and T n where 


-6 — 2 ^ 

2 


r=( 

7. Find W 6 and where 

fF 

8. Find X 7 and X n where 

X 

9. If ^4 = /—2 — 9\, prove by induction that 


3 
=Li i) 
=U -1) 


A« = ll - 


n d- 

( 1 — 3« —9« \ where n is a positive integer. 

n 1+3 n) 


Write down the matrix which is Lim - A n . 

«—► 00 ft 

10. Prove that, when n is a positive integer 


/cos 6 

—sin ey 

/cos nd 

—sin n6\ 

\sin 6 

11. Prove that 

cos O) ~ 

\sin n6 

cos nd] 

(a 

b \» 


(a b \ 

\ka 

12. Show that if 

kb) =(a + 

kb )*- 1 . 

\ka kb) 


then 


b sinh a\ 
cosh a f 

cosh na b sinh na\ 
sinh na cosh na 


/cosh a 
U- sinh a 


(MME) 
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An iterative process for finding y/2 using matrices 

/I 2\ 

5.13 Consider I I and guess the square root of 2 as 4 say, i.e. our 
V 1 /A\ 

first guess go = 4, which we write 


Then 

Now 


90 -0 

and 

6 

gi = 5 = 1-2 

90 - GO 

and 

16 

82 = IT " 145 

9G9-0) 

and 

38 

g 3 = 27 " 1-407 

2 \ / 38 \ / 92 \ 

1 / \ 27 / “ \ 65 / 

and 

92 

S 4 = 65 « 1415 


It is an interesting exercise to prove that the sequence go, gi, g 2 , g 3 , g 4 
. . . is converging to Vl* 

This is an interesting side of the problem of finding eigenvalues by 
iterative procedures. In the above case one of the eigenvectors will be 

found to be 


To find the eigenvalues and eigenvectors using iteration 
5.14 Consider again ^ j with our first guess as Then 

C M-p-’Of) 

Instead of applying ^j to as before, we use 

c so-o-p-n 




/I 2Wl-407\ /3-407\ /1-415\ 

(l l)( 1 ) "( 2 - 407 )" 2407 ( 1 ) 


and so on. 

* A solution is found in Mathematical Prima No. 19 (from St. Dunstan’s College 
Mathematical Society.) 
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The sequence 5, 2-2, 2*45, 2*407 converges to give the eigenvalue 
whilst the other sequence 4, 1*2, 1*45, 1*407, 1*415 converges to give the 
corresponding eigenvector. 


5.15 Justification . Consider a guess gi, say, then 


gi = CiXl + c 2 x 2 

where xi and x 2 are the two eigenvectors corresponding to Ai and fa 
(the eigenvalues) and ci, c 2 are scalars. Now 


so 


Agi = ci^4xi + c^4x 2 and ^4xi == AiXi 

Ax2 = A 2 x 2 

^4gl = ClfaXi + C 2 A 2 X 2 

(fa fa \ 

= m — cixi H-c 2 x 2 

\m m ] 


= mg 2 

where m was chosen as shown in the example above. 

If fa > fa the contribution from xi to the guess g 2 is less important 
than it was in gi and eventually x 2 is dominant (provided that c 2 ^ 0). 


To find the other (smaller) eigenvalue 

5.16 Remember that fa + fa = a + d (the trace of the matrix—the 
sum of the elements in the leading diagonal) or since 

Ax — Xx 
=> A —1 x = | x 

use A- 1 to give the larger value of j or the smaller value of A. 

5.17 This idea leads to an iterative process for solving quadratic 
equations. 


Example 

Solve the equation 

A 2 — 5A + 2 — 0 

Consider this equation as the characteristic equation of some matrix 
A say. Then if 



we have a + d = 5, ad — be = 2. 
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Let a = 5, d = 0, b = 2, c = — 1, this is one set of possible values 
for a , b , c and d. Then 



We then proceed as before to find the eigenvalues (the roots of our 
equation) and the eigenvectors. Consider as a first guess 


*■(!) 

«- U:)(!)-(-!)-'CD 

try g2 = 

U DO O'CD 

u Dcn-c”)-»cr") 

u dch-(■“)•*»(-;*) 


It is now clear that the largest eigenvalue lies between 4*56 and 4*57. 
The smaller one can be found easily since the sum of the roots is 5. 


Check 

Using the formula for the solution of quadratic equations 

, 5 ± V25 -8 

2 

5 ± VT7 
~ 2 


The largest value of A is thus 


5 + VT7 


5 + 4-123 

** 2 

m 4-561(5) 


2 
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Further consideration of the problem of finding roots using 
matrices.* 


5.18 The difficulty of the process shown is that the series does not 
converge to give V2 very quickly. However it is worth considering for 
yourself what happens if the first guess is a wild one (4000 say). This 
process does not give the negative square root. 

Consider again 


(i 3‘-G 3 

13 4\ 2 _ m 24\ 
\2 3/ “ \12 17/ 


Mean of f and I 
= 1-415 

Mean of and if 
= 1-4142 


Consider also 


(5 2\ z _ 111 20\ 

\1 5/ “ \10 27/ 

'27 20\ 2 _ /929 1080\ 
,10 27/ ~ \540 929/ 


Mean of Hand I? 
= 1-72 


Mean of H2 and \W 
= 1-442 


This process should be compared with the usual iterative process for 

finding square roots, i.e. x r +i — i 

How could we find V— 2? 

Consider 



The ratios clearly tend to V2 . j. 


Examples of other uses 
1. To find 2* 



X guess at 2* 


* We are indebted to Alastair McDougall for these ideas. 
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A 2 = 



We allow no leeway in 2° 
or 2f 


New guess iff + I + ir] 


1-4 

1-33 

1-25 


3)3-98 


The mean is 1-33 


/184 232 292\ 
A 4 = l 146 184 2321 
\l16 146 184/ 


Our estimates of 21, 2§ are now iff, iff respectively. Three estimates 
for 21 can be got from A 4 


iff = 1-2586207' 
if# = 1-2602740 
iff = 1-2608696, 


The Arithmetic mean gives 
21 = 1-2599214 
(An error of 3 in the last digit) 


2. To find 51 



Our estimates are 

if = 1-357) 

if = 1-5791 The Arithmetic mean is 1-501 
ff = 1-667 ( (An error of 0-006 or 0-4%) 
if = 1-4001 
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3. To find the complex cube roots of 1 say, it is necessary to ‘prime’ 
the process by feeding in complex or imaginary estimates. 

We choose j, as our estimate for l£, — j, for It, 

-j 1 A* /—1+2 \j -1-2 j 3 y 

J ~j 1 = 3 -1+2 j -1-2; 

1 j ~jJ \-l-2 j 3 — 1 + 2;/ 

1-9 _ 16; 19 ... 

= I -9 + 16; ... 

\ 19 

_ .. . „ -9 + 16; -9 - 16; 19 

Taking the mean of-^-, ZT+I6j’ -9 - 16; 

we get —0-5001 + 0-8663; (giving an error 

of about 0-03%) 


Exercise 5.2 

Use the iterative processes described to find 

1. V5 taking (a) as your first guess 3, (b) as your first guess 100,000. 
(Find V5 to two-decimal places in each case.) 

2. Repeat Question 1 for V7. 

Solve the equations 

3. 2 2 -22 — 5 = 0. 

4. 32 2 - 72 + 1 = 0. 

5. What happens if the quadratic equation given has complex 
solutions? Investigate the solution of 

22 — 2 + 3 = 0 

Powers of 3x3 matrices and 4x4 matrices 

5.19 The methods given for 2 x 2 matrices may easily be extended 
to deal with larger matrices. In general the ideas are the same although 
the algebra is often more complicated. 

Example 

Find P n where 

3 2 

1 4 

-2 -4 
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To find the eigenvalues, remember, that the determinant 
|P - A/| = 0, i.e. 


3 - A 2 2 

1 4 - A 1 

-2 -4 -1 - A 


= 0 


Multiplying out and simplifying we have 

A3 - 6A 2 + 11A - 6 = 0 
o (A — 1)(A - 2)(A - 3) = 0 
o A = 1, or A = 2, or A = 3. 


Consider when A = 1 and we have an eigenvector 




3x + 2y + 2z = x) x + y + z = 01 

x + 4y+ z = y\=> x + 3y + z — 0\ 

—2x — Ay — z — z) x + 2y + z = oj 


Hence y = 0 and we can take z = 1, x = — 1. 
An eigenvector is in this case 



when A = 2. Then 



3x + 2y + 2z = 2x 
x + 4y + z — 2y 
—2x — 4y — 4z = 2z 4 


x + 2y + 2z = 0 1 
=> x + 2y + z = 0 
x + 2y + 3z = 0, 


Hence z — 0 and we can take y = l, x — —2. 
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This time an eigenvector is 



when A = 3. Then 



3x + 2y + 2z = 3x| y + z — O' 

x + 4y + z = 3y \ => x+y + z = 0 

—2x — 4 y — z — 3z) \x + y + z — 0 ; 

Hence x = 0 and we can take z = 1, y = — 1. 

An eigenvector is 



To find P n we now use the idea, as before, that we can write PU — UD 
where U and D are defined as previously, so that 

P n = UD n U~ 1 


/-I 

-2 

°\ 

/I * 

0 

°\ 

= 0 

1 

- 1 

0 

2 « 

0 ).£/-! 

\ 1 

0 

1/ 

\o 

0 

3*/ 


where the columns of U are the eigenvectors corresponding in order to 
the entries in D, 

We now need to find C/*" 1 if it exists. This can be done using one of 
the methods described in Chapter 2. 

It is given by 




Hence 


/- 1 

-2 

0\/l n 0 

P" = i 0 

1 - 

i 

■ 

o 

\ 1 

0 

l/\0 0 

/-I 

—2-2 n 


O 

II 

2» 

—3” I ( — 1 

\ 1 

0 

3”/ \—1 

/-1 + 2» +l 

-2(2” + 1) 

II 

T 

- 3" 

2” + 2.3” 

\ i- 

- 3" 

2(1 - 3”) 



-2(2- + 1)\ 
2- - 3 n I 
2 + 3 n / 
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5.20 It is possible, in general, to use an iterative process for finding 
the eigenvalues and eigenvectors of a 3 x 3 matrix. 


Example 

To find the eigenvalues of 



' 3 2 2\ /—0-l\ /—01\ / 0 \ 

1 4 l}( —0-91 = ( —2-71 ^2-8 ( —0-96) 

-2 —4 —1/\ 1 / \ 2-8/ \ 1 / 


and finally 
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We hope that it is possible to recognize then the eigenvalue (actually 3) 

/ °\ 

and the eigenvector—actually I—11. This gives the largest eigenvalue. 

The smallest one can be found using the same process on the inverse 
matrix and the third using the trace. 

5,21 Certain 4x4 matrices occur quite frequently when dealing 
with probability matrices and we consider this special type (an example 
is given later, see p. 140). These are 4x4 matrices which may be 
partitioned into four 2x2 matrices to give, for example 


Q = 


o 


and so on 


j and / is | 



X') 

/E 2 

EF + F\ 

" \o 

I ) 

^ (E 3 

E 2 F + EF+F\ 

Q 3 = {o 

/ ) 

/E 4 

E 3 F + E 2 F + EF+ F\ 

e 4 =U 


^ l £n 

E*-if + E n ~ 2 F +... + . 

Q n = \o 

/ 

(E n 

(£” _1 + £”- 2 + ... + /). 

~\o 

/ 

(E n 

(/ - E n )(I - E)~ 1 F\ 

~ \o 

I ) 


1 


This follows since the top right hand entry is a geometric progression, 
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APPLICATIONS 

Apart from a number of topics mentioned in the later sections of this 
book, one of most important applications—or situations where powers 
of a matrix is required—is in the realm of probability . 


5.22 Example 1 . If a person catches a bus one day there is a prob¬ 
ability of \ that he will catch it the next day. On the other hand there 
is a probability of I that if he misses the bus one day he will also miss 
it on the second day. 

We can display this information as a matrix 


First 


day 


Second day 

Catches bus Misses bus 


Catches bus 
Misses bus 


i 

f 



= p. 


It can be shown ( Matrices 2 Ch. Ill) that P 2 will give the probabilities 
for the 3rd day, e.g. 


First day 



Third 

day 


Catches bus 

Misses bus 

Cl 

' * 

if \ 

M\ 

* A 

a ) 


This means, for example, that if the bus is caught on the first day there 
is a probability of i h that it will be caught on the third day and so on. 
We can continue this process: 


„ /O'36 

0-64\ 

„„ /0-328 

0-672X 

/0-3344 

0-6656V 

P ~ \0-32 

0-68/ 

’ P ~ \0-336 

0-664/ 

’ P ~ \0-3328 

0-6672/. 


It is clear that the two rows of our matrix are becoming closer and closer 
together and the probabilities are depending less and less on the initial 
probabilities. Consider a long time ahead when the probability of 
catching the bus is nearly the same whether it was caught or missed 
on the first day. Let this probability be p. 

A large number of days ahead 
Catches bus Misses bus 

First day ^ ( P ! ~ P ) 

M\ p 1 ~p ) 

As we have chosen a day so far ahead that these probabilities are 
almost equal, then the probabilities for the day after this must also be 
approximately the same. 
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Passing to the limit, when n oo, 



o\P + 10 - p) =P 
Op = J 

Hence 

1 ) 

when n is large. 


5.25 Example 2. Three dogs A, B and C are all fighting for the same 
bone. When A has it there is a probability of \ that he will keep it for 
5 minutes, and a probability of \ that he will lose it to B (and similarly 
for C). The other probabilities are given in the matrix 


Owner of bone 
at time t 


Owner of bone at time t + 5 

ABC 
A/ \ i \ \ 

B i I 0 

C\ i 0 I / 


If A has the bone initially, find which dog is likely to have the bone 
(a) after 10 minutes, (b) after 20 minutes, (c) eventually. 

Call the matrix 



The solution requires us to find (i) P 2 , (ii) P\ (iii) P*. 

(0 [\ i i \(\ i i\ 

^ 2 = i I 0 J f 0 

\i 0 |/\i 0 i/ 

A *\ 

H Tel 

A HI 

Hence if A has the bone initially, the first row of the matrix gives 
us that the probability of A still having the bone is §, of B having it 
A, of C having it A—so A is most likely to still have the bone. 
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(ii) Now 

P 4 = P 2 . P 2 


/ f 

-h 

A\ 

/ f 

-A 

A\ 

= J A 

10 

16 

* 

[A 


* 

\* 

ie 

w 

\* 

A 

H/ 

=jj 
16 2 ( 

(86 

85 

85\ 



85 

126 

45 



$5 

45 

126/ 




/0-3359 0-3321 0-3321X 
or P 4 ^ (0-3321 0-4922 0 1758) 

\0-3321 0-1758 0-4922/ 

It is becoming clear that as we continue (as in the previous example) 
the entries in our matrix tend to certain limits. Here, as n —> oo, 

(i i l\ /i i l\ 

P n = £ i £ not £ £ £ 

\£ £ £/ \3 6 2 / 

as it would appear! 

Find P n for yourself, using the idea that 
P" = UD n U~ 1 

The eigenvalues are £, f and 1. Hence after 20 minutes (given that 
A had the bone initially), A is still the most likely dog to have it, and 
eventually they each have an equal probability of having it (although it 
appears that A’s chances are always slightly the better). 

5.23 Example 3. Two criminals A and B have just robbed a bank 
and are awaiting their getaway car. A squad car full of policemen 
arrives on the scene. The probabilities of the criminals being free for a 
period of 1 minute are shown in the following matrix, 


FREE AT TIME t 


FREE AT TIME t + 1 

A,B A B nil 



i £ i\ 
i o | \ 
oii 
001] 


i 

i 

0 
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Make sure that you understand the meaning of the entries in the matrix. 
Find 

(a) The probability of both A and B being at large 4 minutes later, and 

(b) The probability that both A and B will be caught eventually. 

(c) The probability that after 5 minutes A escapes. 

We first partition the matrix as follows, 



1 

i 

i 

1 ' 

i 

1 

0 


i 

0 


i 

0 

0 

0 

i > 



We now find P n . 

The eigenvalues of P are found to be 0, f, §, with corresponding 
eigenvectors 


"f s i 

i i i 
i t — I 


= U 


Since 


Pn = UDnU- 1 




/-§ I 

i\/° 

0 

0 

= ( I i 

t 0 

nr 

1 0 

\ i I 

-f/\o 

0 

(!)’ 

/o §(!)» 

Km 

J- 

■f § 

- o i(|)» 

id)” 


I i 

\o Kir 

—§(§)”> 

j \ 

i I 



/(§) 2 (t)” + G) 2 (t)“ M(f)» + KtXD” (§) 2 (i)" - I • i • (l) n \ 
i ■ 1(f)” + f • i ■ (!)“ G) 2 • (I)” + (f) 2 (f)» i. I •(!)*- (!) 2 • (i)» ] 
\(§) 2 (t) w - ! • Ht) n i • t • (f)» - (I) 2 (D” (§) 2 (!)" + 


Now 


pn (/_p»)(/_i>)-l.m 
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(i-py 1 


(/-me 


/H 

A 

«\ 

(A 

10 

A 


-A 

26 / 
To/ 



/i\ 

and 6= 


Now as « -*• oo, 


so that 


/O 0 0\ 
P »->(0 0 OJ 
\0 0 0 / 


Lim A” — 

n—*oo 


( 000 
0 0 0 
0 0 0 
0 0 0 



i.e. the criminals are caught! 

The probability of A and B being at large 4 minutes later is given by 
the top left hand entry in A 4 , i.e. 

(f) 2 (f) 4 + (Mf) 4 = Aft m 0-1428. 

The probability that A is at large after 5 minutes is given by the 
middle entry of the top row of P n in the case when n = 5, i.e. 

i(f)(f) 5 + i(f)(f) 5 


Exercise 5.3 

1. The President of the United States informs A of his decision whether 
or not to run again in the coming election. A then passes the information 
to B, B to C and so on. At any stage there is a probability p that the 
announcement will be changed. We thus have a probability matrix 

B 

Yes No 

Yes/l-p p \ 

No \ p 1 -p) 

for the transmission of the announcement. 

Evaluate the list as n-> oo for (i) p = 0*1, (ii) a general value p. 
What do you notice about the resulting matrix in part (ii) ? 

2. The storage of a digit in a computer consists of passing it through 
a number of stages inside the machine. At each stage there is a prob¬ 
ability p that the machine will accidentally change the digit, and a 
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probability q of the machine breaking down. Once the machine has 
broken down it will remain so. Set up a transition matrix (using Right, 
Wrong and Broken Down as stages) and evaluate 

(i) If p = 01, <7 = 0-1 the probability of correct storage in 3 

stages. 

(ii) What happens as n oo if p = 0, q ^ 0. 

(iii) What happens as n -► oo if p ^ 0, q — 0. 

For p = 01, q = 01, find the probability that the digit is stored (a) at 
all, (b) correctly in five stages. 

3. If a man earns a high income, there is an SO % chance that his son 
will also earn a high income; there are 10 % chances that he will earn a 
middle or low income respectively. If he earns a middle income the 
high: middle: low ratio is 20:60:20 %, and for a low income the figures 
are 25:25:50%. Set up the transition matrix and find the probability 
that the grandson of a low-income man is a high-income man, and the 
probability that the great grandson of a high-income man is a low- 
income man. (n.b. Do not attempt to evaluate eigenvalues: square and 
cube the transition matrix as it stands.) 

4. Assume that in Question 3 the probability of a man having a son 
is 0*8: construct the new transition matrix and answer the same ques¬ 
tions as in Question 3. 

5. The theory of genetics as developed by Gregor Mendel postulates 
that, for any given characteristic, an individual possesses two factors 
known as genes which determine that characteristic. Each parent passes 
on one gene to an offspring: the genes of the offspring are independent, 
and if a parent has two different genes, the one to be transmitted is chosen 
at random. Each gene may be of two kinds, dominant (denoted by a 
capital letter) and recessive (denoted by a small one): a person with 
either two or one dominant genes will show the dominant characteristic, 
a person with two recessives will show the recessive characteristic. 
An example is that of height of pea plants: we consider there are two 
types: tall, dominant (T) and dwarf, recessive (t). 

The following matrix shows the probability that a pair of offspring 
selected at random from those of any one of the pairs of parents on 
the left exhibit the characteristics shown at the top. 


PARENTS 


OFFSPRING 



TT Tt 

Tt tt 

TT tt 

Tt Tt 

TT TT 

tttt 

TT 

Tt 

i 

£ 

0 

0 

i 

i 

0 \ 

Tt 

tt 


0 

£ 

0 

i 

0 

i 

TT 

tt 


0 

0 

0 

1 

0 

0 

Tt 

Tt 


i 

i 

i 

8 

i 

A 

~lS 

TT 

TT 

i 

0 

0 

0 

0 

1 

0 

tt 

tt 

\ 

0 

0 

0 

0 

0 

1 / 
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By taking the nth power of this matrix we can thus find the probable 
characteristics of the nth generation. 

Find the probability: 

(i) That the initial mating of two hybrids (Tt, Tt) will eventually 
produce a predominant strain (TT, TT). 

(ii) Of ending with a pure recessive strain (tt, tt) if we do not know 
the original parental types. 

Explain why the probability of eventually reaching a pure dominant 
strain from (TT, tt) is the same as that from (Tt, Tt). 

6. In Utopia there are three types of weather and on any particular 
day the weather belongs to just one of these: 1, sunny; 2, rainy; 3, cloudy 
but dry. It has been observed that if the weather on a certain day is of 
type i then that on the following day is of type j with probability P#, 
where P tj is the entry in the ith row andyth column of the array below: 


0-3 

0-3 

0-4 

0-2 

0-4 

0-4 

01 

0-3 

0-6 


If a = (ai, 02 , 03 ) describes the probabilities of the 3 types of weather 
on a given day, and b = (bi 9 £ 2 , £ 3 ) the consequent corresponding 
probabilities for the next day, 

3 

(i) prove that bj = 2 

i = 1 

(ii) show that there is an a such that b = a (i.e. there is a steady 
state in which the statistical properties of the weather do no change 
with time); 

(iii) given that the steady state obtains, find the probabilities qy 

that, if the weather on a particular day is of type i, then that on the 
preceding day is of type j. (CS) 



Karnaugh 


6 

Maps and Associated 
Matrices 


6.1 Consider a Venn diagram illustrating two sets, x and y. 



The total area can be divided into four pieces corresponding to the 
four regions A, B, C and D. These regions are given by* 

A is x'y' B is y'x 
C is xy D is x'y 

Clearly the whole region is given by 

x'y' + xy' + x’y + xy 


These four functions 

♦ », 

xy, xy', x'y and x'y' 

are known as minimal polynomials (in the case where we are considering 
two sets). 


6.2 Secondly, we can consider a Venn diagram illustrating three sets. 
* We use xy for x rxy, x + .y for x vj .y. 
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This time the area can be divided into eight regions A, B, C, D , E, 
F 9 G and H where A is x'y'zB is xy'z', C is xy'z , D is x'y'z , is is xyz', 
F is G is and H is x'yz'. 

This time the whole region is made up of 

x'yz’ + xy'z' + x'yz' + x'y'z + xy'z + x'yz + xyz 

Each of these eight functions is a minimum polynomial (in the case 
where we are considering three sets). 

ORDER 

The order of the elements in the minimal polynomial is important and 
found as follows: 

For two sets x and y the corresponding J 2 * matrix is 

/0 1 0 1\ 

VO 0 1 1/ 

This can be thought of as a truth table (or table of closure) for x and y 
if x is the first row and y the second, i.e. 

x 0 1 0 1 

y 0 0 11 

and then considering the columns as the combinations of x and y. 
For each column we write dowfc the minimal polynomial combination 
which is true, i.e. 

x 0 10 1 

y 0 0 11 

t t t t 

x'y' xy' x'y xy 

and the order of the functions is shown. 

* i.e. the matrix with columns in the order of binary numbers reading upwards 
(see p. 161). 
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Similarly for the three sets x, y and z, the Jz matrix is 

x/0 1 0 1 0 1 0 1\ 

yl 0 0 1 1 0 0 1 l) 

z\ 0 0 0 0 1 1 1 1/ 

t f t t f t t t 

x'y'z' xy'z' x'yz' xyz' x'y'z xy'z x'yz xyz 

and again we have the order of the functions for the minimal poly¬ 
nomial. 

This idea naturally extends to 4 or more sets. 

6.3 This idea can be extended to the case in which four sets are 
considered. 



Here there are 16 minimal polynomials of the type xyzw and the 
other possible combinations. 

6.4 The importance of these polynomials is the way in which they 
can be used to simplify switching circuits. We have met them before 
when we were given the properties of a switching circuit and were 
required to find the Boolean function representing the circuit.* 

The Venn diagrams which we have drawn are not very useful in 
their unsystematic outline. To help us in simplifying circuits, a modified 
version of diagram has been developed by Karnaugh. 

KARNAUGH MAPS (sometimes called Veitch Diagrams) 

6.5 These are rectangular or square representations, rather than the 
oval or irregular shapes that we have used before. 

It is not worth using these maps for less than three variables in 
practice. Here is a map for two variables. 

* For example, see Sets and Logic 2 y Chapter 2. 
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An entry 1 is inserted in the diagram to represent the function to be 
simplified. The 1 in the top right represents the function xy ' and the 
other squares represent the three other minimal polynomials of two 
variables. For three variables we have 


y 


1 1 

1 xyz’ 

xyz 

x y'z 

1 

x y'z' 

x'y z r 

x’y z 

x'y'z 

1 

x'y'z' 


z 


Here the entries represent xyz* + x’y'z* + xy’z*. 


y 


1 

1 








z 


This represents xy (or xyz ' + xyz). 


y 



1 

1 



1 

1 



z. 


This represents z. 


6.6 These maps for three variables are used later on in the matrices 
of the Karnaugh map (see 6.8). We now return to the case of four 
variables to illustrate how the maps may be used to simplify functions. 


y 









i 





1 
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The function illustrated above is wx'y'z'. When the function con¬ 
tains all four of its constituents then there is only one entry in the table. 
When only 3 or 2 or 1 of the constituents is used we have maps as below. 


y 









• 1 




1 






z 


The function xyz'. 


y 



1 

1 



1 

1 

1 










z 


The function zw\ 


y 






1 

1 

1 

1 

1 ■ 

1 

1 

' 1 






z 


The function x. 


Simplification 

6.7 The digit 1 is placed in each cell required to represent the given 
function. These digits are then grouped as follows. Marked cells next 
to each other are grouped into rectangles, squares, rows, columns and 
pairs allowing ‘end-over’ adjacencies in both directions until each 
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marked cell is grouped if possible. [Note that you can treat the four 
corner cells as forming a ‘square’ because of ‘end-over’ adjacencies.] 


Examples of grouping 
Example 1 


y 


lLI 

1 


f 1 1 

:_i__ 

i : 
__ / 


! i ; 




! i ! 




i i j 

LlJ 


z 


Example 2 

_ y 







::o 



rr 




cr_ 

~y_) 


i 


Cl I 

“ ~r 


Clj 

i 

1 







z 


Example 3 


* -i 

-1 

1 \ 


C" 




Ll_ 

_ ij 





i i 

-,j 


rr 

7"| -T" 

i _L j 

T- 


Examples of simplification 
Example 1 


y 







n~j 




LjJ 

l l 

\ 

1 1 



I 

_ 1 _ 

1 

- J J 


z 


f = xyw + x'yz'w + x'yzw + xz/ 
=*yw + xzy' 
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Example 2 



f — xy’ + xz' + xyzw + x'y'z' + y'z'W + yz ' + zV 
= z' + wx + xy'w' (or z' + xy' + vvx by different grouping) 


Example 3 



f == x’yW + ywz' + xzW + x'y'z'w' + xy'zw' + x'yzw' 

= z'w + xzw' + x'yW + x'z' 

These are not the only possible groupings in each case. The ‘better’ 
the grouping, the simpler will be the final function. 

6.8 In some cases it may be much easier to group the empty cells 
together and hence find the complement of the function. The function 
can then be found using De Morgan’s laws. 


Example 4 
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Here 
so that 


f' = xz+ yzw' 

/ = (*' + z') ,(y' + z' + w) 


y 







TS 

"_~Lj 


;'l 



l 1 



1 "■ 
j 1 



I 

i ; 



! i 



f '--t 



to 






V. 


The alternative grouping is given above. 

/= z' + x'w + x'y'w' 
The two switching circuits involved are 



and 


-z'- 

—<- X? -W--- 

Naturally the first, with the least switches, is the ‘better.’ 

There is a better grouping still (z' + x' (w + jO) 

Exercise 6.1 

Use Karnaugh maps to simplify the following functions: 

1. /= wxy'z' + yzw' + wy' + wxy. 

2. f = wz' + xz' + x'yz' + xyzw + wy' + w’x'y’ + w'x'z'. 

3. / = x'y'w + xyz'w' + xyz' + x'ywz + xywz' + xyz. 

4. / = x'yzw + xyz'w' + y'zw' + wxz' + w'x'y' + w'y'z. 

5. /= x'y'w' + x'y'z + x'y'w + zy'w' + xyz’w’ + xy’z'w'. 

6. f = xyzw + xy’z’w + x’y'zw + x'y’z’w' + x’yzw + xy'z’w’. 

It sometimes occurs that we do not mind what the value of one of 
the four functions may be. This situation can be plotted on the map 
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and we can see the function as best we can in our grouping procedure. 
Suppose we have for example, 



z 


with y unrestricted. 

We have entered a U in the map to show that the entry can be either 
1 or 0,* as is most convenient for us in the simplification process. 

7. Simplify the following and give the function. 


y 



z 


Here w is unrestricted. 



z 


Here the unrestricted entries occur due to certain input conditions on 
the circuit. 

Give the simplified function for the above map, 

(a) by grouping to find the function, and 

(b) by grouping to find the complementary function, and hence the 
function using De Morgan’s laws. 


* All the U's must be the same, either all 1 or all 0. 
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These maps are of little use except in the simplification of functions 
of four variables. 


MATRICES ASSOCIATED WITH KARNAUGH MAPS 

6.8 A Boolean matrix may be defined as a matrix which contains 
Boolean entries, i.e. 1 and 0. We shall see that such a matrix can be 
arrived at in a variety of ways. Combinations of these matrices, too, 
can be achieved in different ways. Here we consider the matrices 
associated with the Karnaugh maps. 

6.9 Consider two switching circuits of three variables. 


^- z 7 - 



The Boolean functions which represent these two circuits are 
fi = xy'z + yz 

and fi = xz' + x'y + z. 

These two can be written in the form of minimal polynomials by the 
use of Karnaugh maps as shown below. 


y 


0 . 

1 

1 

0 

0 

1 

0 

0 


z 


The function fa 


y 


■ 

1 

1 

1 

1 

1 

l 

0 


z 


The function fa 
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fi can then be expressed as the matrix 


and fz as 


/O 1 1 0\ 

lo 1 0 0/ 

/i 1 1 n 

U i i o) 


6.10 When the two circuits are placed in series the overall transmission 
function is given by 

where/ 1/2 is found by multiplying together the corresponding elements 
of /1 and fz. Here 


i.e. 


/ 0.1 1.1 1.1 

•' 1 --' 2 ~\ 0.1 1.1 0.1 

f = (° 1 1 °) 

J \0 1 0 0 / 


0.1\ 

0 . 0 / 


where we are using the laws of Boolean algebra, where 0.0 = 0; 
0.1 = 1.0 = 0; 1.1 = 1. On the other hand, if the two circuits 
were placed in parallel, the overall transmission function would be 
given by 

S ~h +/* 

where the elements ofg are found by adding together the corresponding 
elements of fi and fz, i.e. 

1 + 1 
1 + 1 


+ 1 
+ 1 


i.e. 


fi +/ 2 = 

A 1 

1 


1 + 1 
0 + 1 


+ 1 ) 

+ 0 / 


1 ! V 

1 0/ 


The relevant laws of Boolean algebra here are 

0 + 0 = 0; 0 + 1 = 1 + 0 = 1; 1 + 1 = 1. 

6.11 It is possible to combine circuits containing functions of a 
different number of variables by expanding the Karnaugh map to the 
size required for the circuit containing the most variables. For example, 
consider the two circuits given below: 


/1 = xy + x' 


• if —, 


~X - 

-y ■ 


-y- 

-z 7 - 


f 2 = xyz' + x/ + y/ 
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Here the first circuit is a function of two variables (x and y) and the 
second is a function of three variables. The two associated maps are 


and 


In order to combine these two, the first map is expanded to three 
variables to give: 


A 


and the two transmission matrices are now 

/I 1 0 0\ /I 0 1 1\ 

\i i i lj and \i o o o) 

which can now be dealt with as required. 

6.12 The same procedure can be carried out when the circuits 
contain, for instance, functions of three variables and functions of four 
variables. 

Example 

Suppose we wished to combine our second switching circuit with 
another containing four variables. The Karnaugh map expanded would 
be 


y 


1 

1 

0 

0 

1 

1. 

1 

1 


z 


y 


1 

0 

1 

1 


y 


1 

0 

1 

1 

1 

_ 

0 

0 

0 


z 


y 


• 

•0 

1 

1 

■ 

0 

1 

1 

1 

0 

0 

0 

1 

0 

0 

0 


z 


f% = xyz' + xy' + yz ' 







Karnaugh Maps 


157 


Similarly if our first circuit were to be combined with another of four 
variables we should have: 


y 


1 

1 

0 

0 

1 

1 

0 

0 

1 

1 

1 

1 

1 

1 

1 

1 


fi = xy + x'. 


The matrix representation here would be 




6.13 We now attempt to demonstrate that multiplication and addition 
of these matrices as defined does give the required transmission matrix. 
Consider the two switching circuits in 6.11, put in series, i.e. 


i—* — y—^ i 





fi = xy + x’ fz = xyz' + xy' + yz f . 

We already know* that when circuits are taken in series the resulting 
function is / 1 / 2 . We can then consider / 1/2 and see if this gives the 
required matrix of minimal polynomials. 

/ 1/2 = (xy + x')(xyz' + xy' + yz') 

= (xy + x')(yz' + xy') 

= xyz' + x'yz'. 


A 


The map of/ 1 / 2 . 




1 

0 

0 

0 

1 

0 

0 

0 


z 


* cf. Sets and Logic 2. 
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/I 0 0 0\ 
\1 0 0 0 / 


The transmission matrix / 1 / 2 . 


However, using the defined law of multiplication we have 



1 1 0 0W1 0 1 1\ 

1 1 1 1 / • \1 0 0 0 / 

1 ° ° 0\ , f 

1 0 0 o] asbefore - 


This is not a proof of the method, but an illustration that this method 
does give us the desired result. 


6.14 As a second illustration consider these two switching circuits 
now in parallel, i.e. 



fi = xy + x' 

fz = xyz' + xy' + yz'. 

We have seen before* that circuits in parallel are combined by addition 
to give the function representing the circuit. Here 

fi + fz = xy + x' + xyz' + xy' + yz' 

= xy + x' + yz' + xy' 

= x + x'+yz' 

= 1 . 

The map of fi + fz is thus 


y 


1 

f 

1 

1 

1 

1 

1 

1 


* Sets and Logic 2. 
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with matrix 

/i i 1 n 

\i i i 1/ 

On the other hand, combining the two matrices for f\ and fz gives: 

/I 1 0 0\ /I 0 1 1\ _ /I 1 1 1\ 

\l 1 1 1/ + \l 0 0 0/ \1 1 1 1/ 

as before. Note that this circuit is always ‘on’. 

Notice that matrices of this type are commutative since Boolean 
Algebra elements are commutative under both addition and multiplica¬ 
tion. 


Exercise 6.2 


L HZrCZH 


I —x — y—z ?—I 


S2 


Find the transmission matrices of 
(a) S U (P) S 2 , (c) SiS 2 , id) Si + S 2 . 


Draw simplified circuits for (c) and (d), 

2. Find the transmission matrices of 

(a) p, (b) q, (c) r, (d) pq,(e)p + r,(f)p + q + r, (g) pqr. 



h-cn-r- 
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3. Find the transmission matrices of 



h t2 

ip) tl> (b) (c) tit2 9 id) t\ + t2» 

Draw a simplified circuit for (c). 

4. Find the transmission matrices of 

(a) uu ib) u 2 , ic) mi + m 2 , id) mim 2 . 



U2 



7 

Boolean Matrices 


7.1 As we have already mentioned, matrices with Boolean entries 
can be formed in a number of ways. Consider the two functions 

h = xy 
/2 = xy' + yx' 

* 

These can be expressed in terms of their minimal polynomials as 

fi = 0 . x'y' + 0 . x'y + 0. xy' + 1 . xy 
/ 2 = 0. x'y' + 1. x'y + 1. xy' + 0. xy 

and these equations themselves may be represented in matrix form as 

o-c:: so 

This can be written 

f = Az 

where A is the matrix of the coefficients and f and z are both vectors. 


J matrices 

7.2 These matrices are the Boolean matrices having their entries in 
the order of binary numbers (reading from the bottom upwards). 
For instance, the first three J matrices are 


Ji = (0 1) 


H 


(°o 

t 

00 


ion 

0 1 1 / 

t t f 

10 | 
li 
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/O 1 0 1 0 1 0 1\ 

/ 3 =(0 0 1 1 0 0 1 l ) 

\0 0 0 0 1 1 1 1 / 

t t t t t t t t 

I 001 I Oil I 101 I 111 
000 010 100 110 

Consider now any column matrix or vector z. Each vector z will be 
equal to one and only one of the columns in the corresponding / matrix. 
For example, the vector 



is equal to the third column of J 2 ’, whilst the vector 



is equal to the seventh column of J%. 

7.3 We now define a special law for Boolean matrix multiplication. 
Suppose we have 

A.B = C 


then the columns of C are obtained by comparing the columns of B 
with the corresponding / matrix—suppose the column of B is the second 
in the J matrix then the column of C chosen is the second in A. 

For example to find 

/I 0 1 0W0 0 1 0\ 

\1 1 0 lj ’ \1 0 1 1/ 

A B 


The corresponding J matrix is 

(0 1 0 1\ 

lo 0 1 1/ 


i.e. J 2 . 


0\ . 


The first column of B ^ j is the third column in/ 2 , so the first column 
of C is the third column of A, i.e. Similarly the second column 
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of B is which is the first column of h\ hence the second column of 

C is the first column of A, i.e. and so on. Here 


n o 1 o\ /o o l o\ /i i o i\ 

\1 1 0 1 / \1 0 1 1 / \0 1 1 0 / 


7.4 It is interesting to see whether matrices of this type commute 
under multiplication. 

(0 0 1 0\ /I 0 1 0\ /0 1 0 1\ 

\1 0 I 1/ ' \1 1 0 1/ \1 1 0 \J 

So that 

A.B^B.A 


(Note that AJ — A — JA.) 

Boolean matrices which have two or more of their columns equal 
are said to be singular. The inverse of a Boolean matrix can only be 
found (in fact only exists) when the matrix is non-singular. 


7.5 Since the J matrices are the identity matrices we can find the 
inverse of a matrix by reversing the process of multiplication 

A . A- 1 = J 


To find the inverse of 



i.e. so that 


/I 1 0 0 

\o 1 0 1 



I 0 0\ 

1 0 l) 

)■( 


0 1 0 
0 0 1 


i) 


the first column of A~ l is found as follows. The first column of J is 

the third column of A —so the first column of A~ x is j j), the 3rd column 
of/. V1/ 

The second column of A~* is found in the same way. The second 

column of J is first column of A so the second column of A~ x is , 

the first column of J, and so on. So 


A- 1 


0 0 I 1\ 

,10 10/ 
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/O 0 1 1\ /I 1 0 0\ = /O 1 0 1\ 

\1 0 1 0 / # \0 1 0 1 / \0 0 1 l ) 


so that 


A- 1 .A — A . A -1 = J 


Solution of Boolean matrix equations 

7.6 Consider the equation 

f = Az 

where A and z are known and we wish to find f. 
For example, suppose 


o-c:: do 

and we wish to find The process is carried out as follows 


: z is the 


second column of Jz so that f is the second column of A, i.i 

As a second example consider 

@ /0 1 1 0 0 1 0 1 \ / 1 \ 

= 10 10 110 1.1 

\0 0 0 0 1 1 1 0/ \o/ 

The vector 


is the fourth column of Jz, so 

//A _ M 

I/"/ V 

the fourth column of the matrix above. 

7.7 Eigenvectors occur sometimes, i.e. when 

Az = z 

For example ^ is an eigenvector of 

/I 0 0 1\ 

\0 1 0 1/ 
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since 


10 0 1 
0 10 1 



It is possible to have more than one eigenvector of a matrix. For 
example, consider 


/0 1 0 1\ 
\1 0 0 1 / 


This has two eigenvectors 


and 


since 


c:: i)o-o - c:: so-o 

How many eigenvectors has /„? 


Exercise 7.1 


1 0 0 \ 

1 0 1 } 

1 0 0 / 

3. Are Boolean matrices associative under multiplication of its type? 
Check using the three matrices 

/I 0 0 1W0 0 0 1W1 0 1 1\ 

\1 1 0 0/’\l 1 1 l/’\0 1 1 0/ 

4. Find the inverse matrix of 

5. Find the inverse matrix of 

/0 1 0 1 1 0 1 0\ 

(1 1 0 1 0 0 0 1 

\0 0 1 1 1 0 0 1/ 


1. Multiply the following Boolean matrices: 

(a) (l 1 0 1\ /0 1 0 1\ 

\o i o o/ * \i i o i; 

(b) /0 1 0 1W1 1 0 1\ 

\1 1 0 1 / \0 1 0 0 / 

2. Find the product 

/0 1 1 0 0 1 1 0\ /0 1 0 1 0 

(l 1 1 1 0 0 1 O] • (0 0 1 0 1 

\0 0 1 0 1 0 0 0/ \l 0 1 10 
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U6 


6. Find the eigenvectors of 
(a) /0 0 1 1\ 

\0 1 0 o) 

(c) /0 1 0 l) 


/0 10 1\ 
\1 1 0 \) 


(« /0 1 0 1\ 
U 0 1 1/ 

(d) (0 1 0 1\ 
\0 0 1 1/ 


(e) /0 1 10 1 10 l\(/)/l 010100 0\ 

(10 1110 0 1 ) 

\0 0 0 1 0 1 0 1 / 


0 11110 11 


VO 0 0 1 1 0 1 0/ 


7. Solve the following equations 
(a) 

(&) 

8. Solve the equations 

(«) //i 


CM 

l 0 1 0\ /1\ 
3 1 1 0/ ' \0/ 

CM 

1 1 0 0\ /1\ 
i o i irw 



1 0 0 0 1 1 1 0 ' 
0 10 110 10 


0 0 110 110, 

10 0 10 10 1 
0 10 11111 
1 0 1 0 0 0 1 1 



Output vectors from switching circuits 

7.8 The output from a circuit across a pair of terminals, for given 
input conditions can be found as a vector. 

Consider the circuit 



Boolean Matrices 


167 


This gives 

/12 = xy' + x'y 
/is = x' + xy'y = x' 
fza=y + x'xy' = y 

We can now find each of these outputs in terms of the minimal 
polynomials, by the use of Karnaugh maps. 


/12 



These can be expressed as: 

/12 = 0 x'y' + 1 xy' + 1 x'y + Oxy 

/is = 1 x'y' + 0 xy' + 1 x'y + 0 xy 

/ 23 = 0 x'y' + 0 xy' + 1 x'y + 1 xy 


7.9 This enables us to find the output vector from a circuit if we are 
given the input vector. 

In Boolean matrix form we have 

'0 1 1 0\ 

10 10 - 

fi 0 1 1 / 




As the matrix 


Jz, i.e. 


consists of two variables the relevant J matrix is 

/0 1 0 1 \ 

\0 0 1 1 / 


To solve: Suppose the input vector is ; this is equal to the first 

column in our J matrix, so that the output will be the first column of 
our matrix, i.e. 
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Similarly, if we consider an input vector of ^ this is equal to the 
third column of our J matrix and hence the output vector is 


7.10 Consider as a second example the circuit below: 




/12 = xy + z(y' + z') + (pc' + z)(yz)(y’ + z') 
fi 3 = z + xy(y' + z') + (*' + z)yz 
fu = x' + z + xy(y' + z')(yz) + z(yz) 
fzz = / + z' + xyz + xy(x' + z)(yz) 

fu = xy(x' + z) + (y' + z')yz + (y' + z')z(x' + z) + xyzyz 
fzi =yz + z(x' + z) + (/ + z')xy(x' + z) 

These output functions can, first of all, be simplified using the laws of 
Boolean algebra, to give: 

/12 = xy + zy' 
fiz = z + xyz' 
fli = x' + z 
fzz = y + z' + xyz 
/ 24 = xyz + y'z 
fzi = z 

7.11 The next step is to find the Karnaugh map of each function 
and eventually to write the functions in terms of the minimal poly¬ 
nomials. The Karnaugh maps of /13 and / 2 3 are: 


- 

1 

1 

1 

1 

0 

1 

1 


1 

1 

1 

0 

0 

1 

1 

0 
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These give us, in terms of the minimal polynomial 

/13 = 0 x'y'z' + 0 xy'z' + 0 x'yz' + 1 xyz' + 1 x'y'z 

+ Ixy'z + 1 x'yz + 1 xyz 

/23 == 1 x'y'z' + Ixy'z' + 1 x'yz' + 1 xyz' + 1 x'y'z 

+ Ixy'z + 0 x'yz + 1 xyz 

The whole set of six equations can thus be written in matrix form as: 

( /i2\ /0 0 0 1 1 1 0 1\ 

fis \ /O 0 0 1 1 1 1 l\ 

/l4 1 / 1 0 1 0 1 1 1 1 1 

/23 I “ I 1 1 1 1 1 1 0 1 J 

f 2 4/ \0 0 0 0 1 1 0 1/ 

fj \o 0 0 0 1 1 1 1/ 

As previously, we can solve these equations for any given input vector 




The appropriate J matrix is Js , i.e. 

/0 1 0 1 0 1 0 1 \ 

0 0 1 1 0 0 1 1 ) 

\0 0 0 0 1 1 1 1 / 

Suppose the input is 



then the output will be the fourth column of our matrix, i.e. 



7.12 Our next problem is to investigate the processes described, 
i.e. the multiplication of these matrices and the problem of finding the 
output from a circuit with given input. Consider again, the example 
on p. 164, 
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and return to the original equations, i.e. 

/i = lx'/ + Ox/ + 1 x'y + 0 xy 
/a = lx'/ + lx/ + Ox'y + lxy 

The vector tells us that x = 1, y = 0, i.e. the only non-zero 

element in the minimal polynomial is the one containing xy' —corres¬ 
ponding to the second column. Alternatively, if 

/i = x'y' + x'y and x = 1, y = 0 

we have 

/i = 0 

similarly when 

/ 2 = x'y' + xy' + xy and x = 1, y = 0 

we have 

/2=1 



the second column of the matrix as before. 


7.13 Multiplication of these matrices is carried out likewise; by 
considering, for example 


/I 0 1 0\ /I 1 0 0\ 

\l 1 0 1 / * \0 1 0 1 / 


as being constructed from 


first 

column 



0 

1 



second 

column 


c 


o 1 
1 o 



G 


0 

1 


third 

column 



fourth 

column 


(! 


0 1 0 \ / 0 \ 

1 0 lj’llj 


Circuits involving delay units 

7.14 Suppose that Si, S 2 ,. . . S» represent the states of n delay 
units at any specified instant of time. Then the outputs /i,/ 2 , • • •/» 
represent the states of these units 1 instant later. 
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The diagram below shows how a simple delay circuit is built up.* 



Where -£-(7)-— 


x.y 

and x + y 



represents the Inverter 
and Gate 

or Gate 


In this case 



1 0 0 0 \ 
.0 111 / 



(the actual equations are f\ = S\'S 2 ,fz = Si + S 1 S 2 ). 


7.15 Suppose that at time instant zero, Si and S 2 are both zero, i.e. 

?? >) • (0) 

so 

0-0 

This output is then fed into the circuit again as an input to give a new 
output, i.e. 

* This is identical to the usual method of making gated circuits~*see Sets and 
Logic 2. 
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0-0 

and this is fed in again to give 



and so on. 

A A 

o o 

1 0 

0 1 

0 1 



Time 

instant 

0 

1 

2 

3 


Here the output clearly stabilizes as fi = 0,fz = 1 after 2 time instants. 

If the initial conditions on Si and Sz were different we still achieve 
stabilization quickly, i.e. 

If Si = 0, Sz — 1, /i = 0, / 2 = I 

at once, and similarly when 

Si = 1, 52 = 0 or Si — 1, Sz = 1 


No matter what the initial states of the delay units are, the outputs 
lock on to fi = 0,/2 = 1 after, at most two instants of time. 

Exercise 7.2 

Draw the delay circuits represented by the following equations. 
Investigate the output cycles. 

1. A = V 2 + 

k = *2- 

2. / 1 = 5 2 + 5' 

fz = ^(^2 + ^ 3 ) 

k = sis 3 . 

3. /j = S^.yS'z + 5,5.2 
fz = 5 2 (525 3 + 5,) 

k = S[. 

4 . fi = SiS'z + S'iSz 

k = 5;. 
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Notice that a locking state occurs whenever the vector 
eigenvector of the equation. In the example given 


cm: :: so 



is an 


i.e. 



is an eigenvector of the equation. What happens if we consider 


every other time instant? Clearly it makes no difference if the system 
has a locked output. 



8 

Matrices and Networks 


8.1 Matrices provide an easy method of describing networks. The 
following methods are applicable when a computer can be used. The 
instructions are simplified to give the ideas. 

8.2 Consider the following network of roads linking two towns in 
county A , three towns in county B and two towns in county C. 



We can describe the roads connecting towns in counties A and B as 
follows: 


1 

bi 

bz 

bz 

ai 

1 

i 

0 

02 

0 

i 

1 


In usual matrix notation we may write 



The matrix describing the connections between the towns in county B 
and in county C is 

I Cl C 2 


h 1 1 

bz 1 1 

bz 0 1 
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or in usual notation 



N6W consider a combination of P and Q (we call it P . Q) and see 
what logical interpretation we might give to the resulting matrix. 



This final matrix must tell us something about the roads from the 
towns in county A to the towns in county C. Consider 



Cl 

C2 

ai 

2 

2 

a2 

1 

2 


This tells us that there are two routes from a± to a, two routes from 
ai to C 2 (via bi or b%) and so on. This is the information given by the 
product matrix. 

This method can be extended indefinitely to give us the number of 
routes from any number of towns to any other numbers of towns. 

Some problems 

8.3 In this representation, what would be the identity matrix? An 
inverse matrix? Clearly matrices with negative entries can have no 
physical meaning in the situation we are describing. The idea of an 
inverse matrix can be given a meaning. The matrix P represents the 
routes from towns in county A to towns in county B; the ‘inverse’ 
matrix will describe the routes from town in county B to towns in 
county A , i.e. here 



so that the inverse of P 



or the inverse of P 

P ~ 1 = P' (the transpose of P). 
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8.4 What happens if we multiply P by P' ? What information is obtained 
from the product? 

Consider again the original network 

bx 

c \ 


c 2 

The numbers inserted represent the distance in miles between the towns. 

We can now make up a distance matrix, where each entry represents 
the distance between the towns. Here 



bi 

b 2 



Cl 

C2 

rH 

Q 

II 

6 

4 

— 

Q= b i 

7 

i 

0,2 

- 

5 

3 

b% 

8 

4 





bz 

- 

4 


The - shows that the towns are not directly linked. Here the - would 
be replaced by oo if a computer were used. In practice a very large 
number, say 10 6 would probably be used. 

8.5 How long are the shortest routes between towns in county A 
and towns in county C? We can work out the entries by trial and error. 
They are: 



Cl 

C2 

ai 

12 

8 

a 2 

13 

7 


The problem is—how do we combine P and Q to obtain R . The solution 
is easier to obtain than to explain. Consider matrix multiplication of 
P and Q . We should consider first 



(6 4 -) IT 
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In this case, instead of multiplying corresponding entries and adding, 
we add corresponding entries and write down the minimum, 

e.g. R is 

/min of {6 + 7, 4 + 8, - + -} min of {6 + 7, 4 + 4, - + 4}\ 
\min of {- + 7, 5 + 8, 3 + -} min of {- + 7, 4 + 5, 3 + 4}/ 
or 



We call this operation min, hence R = P min Q. Note that the dashes 
rule out the entries where they occur. 

8.6 Notice that similar situations arise where the numbers on each 
road represent, for example: 

1. The average time taken for each stretch, or 

2. The capacity of drains. 

In the first example the process to find the shortest time is identical 
to that shown for finding the shortest distance. In the second example,* 
the capacity of the drain from a to b is 6 and the capacity for b to c 
is 7, so that the maximum capacity from a to c is 6. In this case our 
maximum drain matrix would be: 


ci c 2 

D = ai 6 6 

02 5 4 

Give a rule for finding D from P and Q, Investigate this rule. For 
example, is it commutative or associative? 

8.7 Consider a very simple network with distances inserted 


d 



* Drain flow in two ways? Investigate. 
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The connecting matrix with distances inserted is then 



abed 

a 

0 4 2 - 

b 

4 0 3 9 

c 

2 3 0 2 

d 

- 9 2 0 


8.8 This contains information about the one-link routes. What are 
the shortest routes between a, b , c and d if we allow one-link or two-link 
routes? 

abed 

S2 -- 

a 0 4 2 4 

b 4 0 3 5 

c 2 3 0 2 

d 4 5 2 0 

How can we find S2 from Sil In fact, Si min Si = S 2 . 

Is the operation min associative? Can we do the operation again? 
What is Si min Si min Si? 

This gives us the matrix of shortest routes when we are allowed 
one-, two- or three -link routes. Notice that, as usually happens, the 
matrix of shortest routes quickly becomes stable, and no further 
combination of min Si alters the entries. 

This type of problem is met by London Transport when working out 
the fares between various parts of London where there are a number of 
routes and changes between them. 

Networks and critical path 

8.9 Consider the following network where the numbers represent 
the times taken on various processes. The problem is to find the maxi¬ 
mum time taken and hence the critical path. (See J. Q. Thompson, 
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An Elementary Introduction to Operational Research, Chapter 1; the 
order of the letters on the following figure has been chosen to comply 
with Thompson’s notation.) 

Note that direction plays an important part in what follows. The 
distance (or time) matrix is: 



a b c d e f 

a 

0 5 6 - - 7 

b 

- 0 - - 6 - 

c 

-- 0 5 4 - 

d 

- - - 0 2 6 

e 

--- 0 3 

f 

0 


or, in more usual terminology, 



We now find M max M by a process similar to the previous one. 
We proceed as if we were going to multiply the matrices but instead 
we take first row and first column etc. add corresponding entries and 
write down the largest term. As we are finding the max, the - would 
be replaced by — oo if a computer were used (in practice —10® say). 

8.10 For instance, consider first row and fifth column, i.e. 

(0 5 6 - 7) 


Our entry is now the maximum of 

{0 + -, 5 + 6, 6 + 4, - + 2, - + 0, - + 7} 

i.e. of {-, 11, 10, -, -, -} which is 11. 

This, in fact, tells us that the longest two-link journey from a to e 
is 11. (In fact it tells us the longest one- or two-link journey is 11.) 
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Under this rule of combination 


M max M = 




This matrix gives us all the longest one- or two-link journeys. We can 
continue this process to give us the longest one-, two- or three -link 
journeys. 

M max (M max M) is 


( 0 5 6 - - 

- 0 - - 6 

- - 0 5 4 

- - - 0 2 



5 6 11 11 7s 

0 - - 6 9\ 

- 0 5 7 11 | 

- - 0 2 5 I 

--- 03 / 

- - - - 0/ 

( 0 5 6 11 

OH 


13 

6 

7 

2 

0 



Is this the same as (M max M) max Ml 


8.11 We can repeat this procedure as many times as we like. Remember 
that we are interested in the maximum time from a to d. 


M MAX (M MAX (M MAX M)) 


5 6 
0 - 
-0 



M MAX {M MAX M) 


This means that the process has now stabilized and that the longest 
time was given by the three -link journey, as the four -link journeys 
did not change the maximum times. 

The maximum time a to /is therefore 17* on a three link journey. 

* Draw a flow diagram of this procedure. 
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It is possible to discover which links are involved by considering how 
this 17 was arrived at. In this case the answer is obvious from the 
diagram and our Critical Path is from a to c to d to /. 

Exercise 8.1 

1. Basically a network consists of a number of nodes, joined by a 
number of links. Here is a typical network. The links are labelled 
1, 2, 3 . . .6. These numbers have nothing to do with lengths, times 
etc.. . . 



It is possible to represent this network in a number of different ways, 
i.e. either as a node I node matrix, a link/link matrix or a node! link 
matrix. Write down the matrices in this case and investigate their 
properties. For example consider the meaning of a node/node matrix 
multiplied by itself; a node/link matrix multiplied by (a) itself, (b) its 
transpose matrix. Could you draw the network given any of the matrices ? 

2 . 



(a) How many ways are there from the towns ai, 02 , 03 to dil How 

many from bi ,. . 65 to d\l 

(b) Suppose the numbers marked represent the distances between 
the towns. What is the shortest route from ai to dil From to dil 
Make up the shortest route matrix. 


182 


Matrices: Pure and Applied 


( c ) Suppose the numbers now represent telephone links (cf. the 
oil pipe/drain problem earlier). Draw up the maximum link matrix 
for towns in country A to town d\ in country D. 

3. Consider the following network. 



The numbers represent lengths. Write down the one link distance 
matrix and find out when this matrix stabilizes under the law min. 
Hence give the shortest distance matrix. 

4. The following network represents a process in which the times 
for the various jobs are inserted. Find the critical path from a to g 
and the maximum time taken. 



5. Look back to the network in 8.2. In this type of representation 
can we give any meaning to addition of matrices? Consider the other 
representations carefully and see if there are addition rules which would 
mean something in terms of the actual networks. 

6. Consider the following network. 



This has nodes at a 9 b 9 c 9 d and e\ links 1, 2, 3, 4, 5, 6, 7 and regions 
a, /S, y, d. 
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We can represent this network using a number of different matrices. 

(a) A node/link matrix. 



i 

2 

3 

4 

5 

6 

7 

a 

i 

1 

0 

0 

0 

0 

0 

b 

i 

0 

1 

1 

0 

0 

0 

c 

0 

0 

1 

0 

1 

1 

0 

d 

0 

0 

0 

1 

1 

0 

1 

e 

0 

1 

0 

0 

0 

1 

1 


( b ) A link-region matrix 



a 

p 

y 

<5 

1 

0 

0 

1 

1 

2 

0 

0 

1 

1 

3 

0 

1 

0 

1 

4 

0 

1 

1 

0 

5 

1 

1 

0 

0 

6 

1 

0 

0 

1 

7 

1 

0 

1 

0 


These entries, show for instance, that region a is bounded by links 
5, 6 and 7. 

(c) A node/region matrix 



a 

p 

7 

d 

a 

0 

0 

1 

1 

b 

0 

1 

1 

1 

c 

1 

1 

0 

1 

d 

1 

1 

1 

0 

e 

1 

0 

1 

1 


Work out the values of 

AA\ A'A, BB\ B'B , CC and CC 


Interpret your answer in each case. 
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7. Calculate A , B and C for the network below. 




9. Find the critical path for the following network, going from 
a to /. 



10. Find the critical path for the following network going from 
a to h. 
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11 . 



Above is an approximate map of South-West England. Use matrices 
to find the shortest route from Truro to Salisbury passing through 
each of the areas shown. The mileages are 



Barn. Ok. Ply. 

Tau. Ilm. Dorch. 

Truro 

87 66 55 Barn. 

50 66 86 


Ok. 

55 57 76 


Ply. 

74 76 94 



Sal. 

Tau. 

64 

Ilm. 

43 

Dorch. 

40 


12. Project. Find out what you can about dual configurations. 
Write down their matrices and see if you can find how (or if) they are 
connected (a book on projective geometry might help). Look up 
Bow’s notation in a mechanics book (e.g. Humphrey, Statics) and 
find out how it works. 

13. The network represents a number of shops belonging to a firm 
in a small area. The problem is to decide which of the two shops 
a or b should be used as a 
central store. We really want to c 
know if the total of the shortest 
distances from atob 9 a to c, a 
to d etc. is less than the total of 
the shortest distances from b to 
a , b to c, b to d etc. 

e 
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The one link distance matrix is 



a b c d e 

a 

0 3 3 -2 

b 

3 0-24 

c 

3-074 

d 

- 2 7 0 7 

e 

2 4 4 7 0 


Find S min S 9 and S min (S' min S). Deduce that a is the most central 
store required and that the total distance is 2 miles less than from b. 

14. Which of a or b is the most central in the network below? 
(See No. 13.) 


d 




9 

Further Situations in which Matrices 
are Used 


9.1 This chapter contains information about other situations in 
which matrices are used. The list is not comprehensive and most of 
the topics are not studied in depth. The topics considered are: 

I An example from the theory of numbers. 

II Recurrence relations. 

III Difference equations. 

IV Solution of differential equations. 

V Four-terminal networks. 

VI Affine transformations. 

VII Matrices and the theory of games. 

VIII Knots. 

IX Matrices with complex entries. 


I AN EXAMPLE FROM THE THEORY OF NUMBERS* 


9.2 


if y = 


— — with a, b, c,d,x> 0, then y lies between - and 
cx + d c 


b 

d' 



If 



* cf. Matrices 2 by G. Matthews, p. 64. 
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then 


(0 Ml = 


>0 


a b 
c d 

(ii) MI <0, and 

(iii) Ml = 0. 

Assuming that \A\ ^ 0, we can write 


y = 


ax + b 


as 


C5) 


mx 


cx + d 

or y = A % x 

This defines a new operation • 

9.3 Suppose now that x = Bm z, where 

5= (r $)’ and P,q,r,s,z>0 


Then 


X = 


Pl+JL 

rz + s 


— ax + b _ a (P z + g) + + jO 

^ * z ~~ cx + d c(pz + q) + d(rz + s ) 

__ (q/7 + 6r)z + aq + bs 
(cp + dr)z + cq + ds 


_ lap + br aq + bs\ 


mz 


\cp + dr cq + ds) 

= (A.B)mz. 

Where A . B is a ‘normal’ matrix multiplication. Hence 
A • (B • z) = (A . 5) • z as x = Bmz. 
9.4 For example, let 


/ = 


V13 + 3 


then 


t = 3 + 


1 


=> t = 


t 

3t + 1 


i.e. 


-u- 


c» * 



where 

Then 
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-G 3 


and so on. In fact 
Now 


t = Cmt 

= C^{c^i) = c 2 *r 

= C 2 *(C*0 = C 3 mt 
t = C n *t. 


c 

c 3 

C 5 = 


p 1 
\1 0 

)• 

/ 33 

\io 


/360 

109\ 

\109 

33/ 

■C! 

3\ , 3 ^ ^ 

/33 

\io 

10\ , 33 . . . 

3 )* "10 < < 


I 

n 

hi 


and so on. 

From II 

10 
3 

From III 

10 

3 

109 33 

=> - < / < - 

33 10 

360 109 , 

=> i09 <f< 33 andso on - 

9.5 Our next problem is to evaluate C n . To do this we must find the 
eigenvector and corresponding eigenvalues of C* 

We have to solve 


(> XH(;) 


i.e. 


3x + y = kx 
x = ky 

o (Jc — 3)x = y 
x = ky 


to find k 


♦ cf. Matrices 2, p. 46ff. 
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These equations are consistent if and only if 


k 3 = Iofc 2 — 3& — 1 = 0 


1 


*i- 
ki = 


k 

3 + vT3 
2 

3 - Vl3 


Taking 


we have 




2 

. 3 + Vl3 


Vl3 - 3 


* = J 


or 


Vl3 - 3)x = 2j 
so that a corresponding eigenvector would be 

(vB - 3) 

Similarly for 


= 


3 - VU 


we have (-Vl3 — 3)x = 2 y and the corresponding eigenvector 
would be 


(-Vil - 3) 

We now use CU = UD, where 


= (l 0 )’ U ~{vU-3 -VB-s) 


and 


Then 


D = 


C = UDU' 1 
C 2 = U. DU- 1 . UDU- 1 
= UDIDU- 1 
= UDW- 1 
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and so on, so that 
C n = UD n U ~ 1 



_1_ /-Vl3 - 3 - 2\ 

4Vl3 \3 - V13 2 1 

i.e. 


? n_ 1 / 

4VI3 V 


_2 

—V13 




—Vl3 - 3 -2\ 


where 


3 + Vl3 


3 - Vl3 


This can be simplified further to give 


( 2 

2 \ 

1-2 r* 

—2t n \ 

\-2u 

-2 1 ) 

[ 2u n+1 

2u n / 


1 /— t n+1 + u n+1 —t n + u n \ 

-\/l3 \+«t n+1 — u n+1 t ut n — tu n ) 

/t n + 1 — u n+1 t n -u n \ 
n-1 _ t n-1 )/ 


_ 1 lt n+1 - u n 

Vl3 \ut(u n — t 


Vl3 \ut(u n — t n ) ut(u 

and t — u = Vl3, ut = —1. Show that 

|C*| = ±1 

and that 


(tn+1 — u n+l)( t n-l _ w «-l) _ (/« — w «)2 = J3(—1)« 

9.6 This method can be used as an approximation process for finding 
the square root of a number* since for example consider 


-CD 


t +1 
f 2 = 3 


* See also p. 128. 
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This idea is to guess a value for V3, 2 say, which we write as 


and then find 


( 


1 3 
1 1 


and to use or 5 /3 as a second approximation. ^ & 1.66. 
The process is then continued 

(! DGHD-t 4 — 


and so on, i.e. 


(i 

/I 3\/x b -i\ _ !x n \ 

\1 1/U-l) w 


1.7273. 


or 


where 


O' 


o-c :ro 

is the first guess. 


A further application: bilinear transformations 

9.7 Bilinear transformations are transformations of the type 

az + b 

w> = —T^’ 
cz + a 

where w and z are complex numbers and a , b 9 c , d are such that 

ad — be ^ 0 

Transformations of this type are extremely valuable in electricity 
theory in the same way as affine transformations.* This type of trans¬ 
formation is the simplest which ensures a 1-1 correspondence of the 
points in the w plane with points in the z plane. 

Consider for example, the transformation 


w = 


2 z - 1 
z + 2 


where w = u + jv, and z = x + jy. 


* See 9.34. 
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Here are some facts about the transformation 






Points on the line y = x transform to give three non-collinear 
points. Is this true in general? 




Investigate this transformation further. 

Answer the following question . 

A transformation from x to y is given by the relation 

_ ax + b 
y cx + d 


all numbers being real. It is written briefly in the form y = T(x). The 
‘inverse operation’, expressing x in terms of j, is written x =T~ 1 {y). 
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Prove that T~ x is of the same form as T and find the corresponding 
numbers a', b' 9 c' and d'. 

Prove also that, if 7i, given by a± 9 bi 9 ci, di, and T 2i given by a 2 , b 2 , 
c 2 , d 2 , are two such transformations, then Ti{T 2 (x)j is also of the same 
type. 

Find the conditions satisfied by a , b 9 c 9 d if 

T{T(x)} = x 9 

identically for all values of x . (CS) 

9.8 Remember that these transformations are, in general 


What type of transformations correspond to Q ? Show that the 

_ * r l a b\ . 1 l+d — b\ 

inverse transformation, i.e. of 1S j- ( . I- 

\c d] ad-bc\-c +a) 

Further information on bilinear transformations can be found in 
books on the complex variable, e.g. Functions of a Complex Variable 
by Phillips; and in books on electro-magnetism, e.g. Electro-magnetic 
Theory by Ferraro. 

These ideas also occur in sequences which satisfy recurrence rela¬ 
tions, e.g. 

au n + b 

U n +1 -; ~i 

CU n + d 


Is it possible to apply our ideas in this case? 


II RECURRENCE RELATIONS* 

9.9 We have seen that a recurrence relation of the type 

au r + b 

U r +1 -"J ~j 

CUr + a 

can be written 

leading to u n = A n m wo, where 



♦ We are indebted to Dr. T. J. Fletcher for the ideas used in this section. 
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Once again we can find A n using the ideas of eigenvalues and eigen¬ 
vectors, and given wo we can solve the recurrence relation. 

9.10 Occasionally we come across simultaneous difference equations 
—for example when we are dealing with chemical reactions. These 
relations can be worked using traditional methods or by the use of 
matrices. 

Examples 

Hydrogen and oxygen react as follows 

2H 2 + 0 2 -> 2H 2 0 

This is a simplified version of what actually happens. The reaction 
only occurs because of the presence of O, H and OH. There are three 
simultaneous reactions: 


o + h 2 -^oh + h 

OH + H 2 —► H 2 0 + H 
H + 0 2 — OH + O 

A single O starts a chain reaction as follows: 



Suppose that the reactions take place at the same speed and that 
there is an unlimited supply of 0 2 and H 2 molecules. Initially we start 
the reaction with one oxygen atom and find out how many O, OH and 
H there are at each successive stage. Let the numbers of these was at 
stage n be O n , OH« and H» respectively. Then 

Ow+i = H» 

OHtm- 1 — On + Hn 
H»+i := On + OH» 

with the initial conditions 

Oo = 1, OHo = 0 and Ho = 0. 
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We can write these equations in the form 


or 

Hence 
Notice that 


and 

and this leads to 


(O n+ i \ /0 0 1\ /On \ 

OHn+i = 1 0 1 OhJ 

JUrt / \1 1 0/ \H« / 

Zn+l === AZ n * 

Z n = A n Zo- 


O n +l = 20 n -l + 0«—2. 

Hft+i = 2Hft-i + Hft-2 
OHft+i = 20Hft-i + OHft-2 

(^ 3 -2A- I)Z* = O . 


This is true for any Zo and hence 

A 3 - 2A - 1= O. 


9.11 This is the Cayley-Hamilton theorem—every matrix satisfies 
its own characteristic equation. We can obtain the characteristic 
equation as follows: 


-l 0 1 

1 —A 1 
1 1 —A 


= 0 or A 3 — 2A — 1 = 0 


The problem is now the usual one of finding A n . We do this in the 
usual way, finding eigenvectors and eigenvalues to use: 

A = UDU~\ A n = UD n U~ 1 


where D is a diagonal matrix of eigenvalues and U is the matrix with 
corresponding eigenvectors as columns. In this case D is 


( 


-i 

0 

0 


0 

-1 + V5 
2 

0 



9.12 A simple example of this kind occurs in the reaction between 
hydrogen and chlorine, 

H a + Cl 2 -* 2HC1. 

This is made up of two simultaneous reactions 

H- + Cl 2 —► HC1 + Cl- 
Cl- + Ha^HCl + H- 
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and can be illustrated as: 

HCl 

(CU 
H 


< 

Cl- 


HCl 


H 2 ^ 

H- 



Stage 0 Stage 1 Stage 2 

This leads to the equations 

H n = Cl«—l 
Cl n — H»-1 
or 

/ H n\ 

\ClJ \1 0) \C\n-l) 

The eigenvalues are +1, —1 and hence it is easy to find 

/0 l\ n 

(l o) f 

in fact it is either 

e 5 - C 9 


depending whether n is even or odd. 

Knowing the initial conditions Ho = 1, Clo = 0 we can now solve 
the problem entirely. 

9.13 Solve the two following chain reactions. 

1. 2O3 -> 302 


Chain initiating reaction 

O3 O2 + O - 

Chain mechanism 



Stage 0 
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2. CH 4 + Cl 2 — CH 3 CI + HG1 


Chain initiating reaction 


Heat or light _ 

Cl 2 - 2 C 1 - 


Chain mechanism 


Cl* 

1 

1 

HC1 1 

J 

[ civ ! 

i 

\ Cl* 

i 

) HC1 

| ch. 

< 

1 

' ch 3 ci j 

} ( 

i ( 

Stage 01 

■ Stage 1 | 

Stage 2 | 

) 

| Stage 3 | 



Stage 4 


Note that these chain reactions are similar in some ways to those 
which occur in nuclear fission. In the fission of uranium, neutrons are 
the propagating particles. However, elementary examples are not easy 
to find. 

Ill DIFFERENCE EQUATIONS 

9,14 In more advanced algebra, it is frequently necessary to solve 
‘difference’ equations of the type 

Un+l — 4un + 3un~! = 0 


given that uo = 2 , u\ = 3. 

At first glance it is not obvious how equations of this sort may be 
solved using matrices, but in fact by using a simple substitution the 
method becomes clear. Put 


This gives 


so 


Similarly 


%+l — Un 

Un+ 1 = 4 U„ — 3 V n 
Vn +1 “ Un 

(u n +l\ _ /4 — 3\/w„\ 

\Pn+l/ \1 0 / \y«/ 

Un\ /4 -3 


Un -1 
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giving us 


CK DO 
CD-C D’C) 


-CD* © 

9.15 We are then faced with our now familiar problem—that of finding 

C D* 

The eigenvalues are 3 and 1 so using the method shown in 5.11, we have 

(r y«(j w.") 


3 - 3 * . 3 * -1 

=>a = _—, b = —— 


C D*- 

This gives us 


3 - 3» /I 0\ 3* - 1 /4 —3\ 

2 \0 l) + 2 [l Of 


i 

f 3»+i - 1 

3 - 3» +1 \ 

( 4 ~ 3 Y = 

2 

2 1 

li 0 ) | 

3» - 1 

V 2 

3-3“ 1 
2 / 


and hence 


/3»+i - l\ , „ /3 - 3” +1 \ 
2 — ) +2 (——) 


_ 3” +1 3 

2 ' 2 


9.16 This method can, of course, be extended to difference equations 
of higher order, e.g. 

Un+1 — 2u n — 5w»-i + 6 w »-2 = 0. 

This time the substitutions will be 


and 


Vn +1 — u n 
W n +1 — V n . 
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Our equations now become 

Un+i = 2 u n + 5v n — 6w n 

Vn+l = U n 
= V n 

or 


Exercise 9,1 

Solve the following difference equations using the above method. 

1. Un+I — 5 Un + 6un -1 = 0 with Wo = 1, Wi = 2. 

2. W»+1 — 4u w -i = 0 with Wo = 1, Wx = 2. 

3. Un+i — 3 Un + 2u n ~i = 0 with wo = 2, WI = 3. 

4. u n +i — u n — 12w w _i = 0 with wo = 3, wi = 5. 

Consider carefully how you would solve 

5. U n +l — 4 u n + 4 Un-1 = 0 ? 

6. w w +i — u n + 2u n ~i = 0 with Wo = 2, WI = 4. 

9.17 Note. Question 5 points to a major difficulty* — that of trying 
to find the wth power of a matrix with equal eigenvalues. These arise 
in cases where the matrix is of the form 



where a is a real number. 

To find the nth power of this matrix 

/2a -a 2 \/2a -a 2 \ /4a 2 - a 2 -2a*\ /3a 2 -2a*\ 

(l 0 Ml oj"l 2a -«sj-( 2 a _.*) “" *' 

Similarly 

i.q /4a 3 —3« 4 \ 

MS =W —2a 3 / 

M4 /5a 4 -4«M 

\4a 3 —3a 4 / 

and in general 

M „ = /(« + !)«” -«a»+i \ 

\ n<x. n ~ l — (w — l)a*y 

* But see the theorem on p. 125. 


'2 5 -6\/u n \ 

1 0 0 11 V n I 
f) 1 0 / \w n ) 


/Un+l\ 
j V n +1 I = 
\Wn+lf 
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In Question 5, a = 2. Suppose m = A, ho = B, then 

u n = (n+ 1)2"A - n2» +1 B 
= 2”[An + A - 2Bn] 

= 2»[Cn + D] 

where A — 2B = C, D = A. 

This is the usual form of the general solution in such cases. 


IV SOLUTION OF DIFFERENTIAL EQUATIONS 

9.18 Consider for example, the equation 

^_ 5 ^ + 6 * = 0 . 

dt* dt ^ 

The first problem is to put this in matrix form. This is done by 
substituting 

dx 


dt 


= y- 


Then 


dy 

-f = 5v — 
dt 


5 y — 6x 


so 



which we write as 


i ;>» 

Consider the particular case where 

sGHJ J)G) 

and suppose that when t = 0, 


dx a 

*““■ s~ f - 


Now 


so* 
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hence 


(;)-(;)*!(-: ;»(;)* 

where <j> is a variable of integration. 

We now substitute for i.e. 

(;khu !)((;)*f(-: sew* 

and so on—substituting again and again ... for 

c )=[-/:u sh+rtt mb 

+ - ■•](?)■ 

Now in this case our integration is straightforward since the elements 


of 


are all constants. Hence 


JKU 5) 1 


so that 


eH-""-© 


Now by Cayley-Hamilton theorem 

J) 

and as the eigenvalues are given by 


A = 3, A = 2 since 


-A 1 
-6 5 — A 


= 0 


we have 
and 


= b 0 + 3*1 
(fit -ss bo + 2*i 
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Giving ns the solution 


dx I = 


3e 2t - le 31 & - e 3t \ /<x 


—6e 3 * + 6e 2< 3eP* — 2e 2 * / 


9.19 The importance of using matrices to solve differential equations 
is that in actual problems you usually have n equations to solve, or at 
least an equation of the type 

dnX . d n -lX . dn—2% « i - Y _ n 

dt n dt n ~ l dt n ~ 2 

By substitution, as above, this can be reduced to a set of equations 
such as 

lir\ /"“ ““•■■"AM 

I ^ \ / U21 U22 • • • U2n \ I *2 \ 


U21 U22 • • • U2n 1 I *2 


W»1 W«2 / \*n 


= Af . [x]. 

A 

In general, the elements of M are functions of t 9 i.e. 


= M(t) . [x]. 


Suppose that when t — 0, 


[x] = [x«] 


/ IF ** = J dt 

[x] = [xo] + £ M(<j>)[x] d<f> 


where ^ is a variable of integration. 


9.20 Let us call 


-I-* 
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i.e. N is an integral operator. Now 

[x] = [Xo] + JV(M)[x], 

We can substitute for [x] in this equation to give 
[x] = [x»] + jV(M)[xo + WM[x]]. 

By continued substitution we have 

[x] = (/ + JV(M) + N(M)N(M) + . . ,)[x 0 ] 

where 

N(M) = f (M) d<j>, N(M)N(M) = P ( M ) f (M) d<f> d<f>. 

Jo Jo Jo 

If the series 

/ + N(M) + N(M)N(M ) + . . . 

is convergent let us call the sum of this series a new matrix P(M), i.e. 

P(M ) = I + N(M) + N(M)N(M) + . . . 

So [x] = P(M)[x 0 ]. 

If the elements of M are constants, we can integrate to give 


= e (M)t . 


e (M)t = ao f a\M + a%M 2 + . . . 

where do, di, a% . . . are constant. 

If M is a 2 x 2 matrix, by the Cayley-Hamilton theorem we have 

e< M)t = 6 0 / + biM. 

In the example, the eigenvalues are 3 and 2, so by substitution 
(** = b 0 + 3 W* 

and e 2t = bo + 2b i] 


* e Mt = I + Mt + + . . . Now M = UDU* 1 where Z> = and u 

can be found. 

=> By substitution 

e m = V [/ + Dt + + . ..] U- 1 = U (q 3 ‘ ® 2( ) £/-i = UboU- 1 + biUDU- 1 

r y-c 3 +-g 5)- 


hence 
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Hence 

b 0 = 3e 2t - 

bi = &t _ e 2t 

=► e<^ = 6o/+fei(M) 


and 

3e 2 * — 2^ e 3 * 

-6*^ + 6e 2 * 8^ 

which gives x — Ae 2t + Be 2t as usual 




Note on differentiation and integration of matrices 
9.21 Consider the matrix 



then the differential of L(x) will be, by definition, 


Lim 

dx -+0 


L(x + <3x) — L(x) 
6x 


( f(x + dx) -f(x) g(x + dx) - gpc) 

_ T . I dx dx 

~ to™ 1 h(x + dx) - h(x) i(x + dx) - i(x) 

\ dx dx 

= {^^c) ^'(x))’ P rovided /(*)> S( x )> h (x), K*) are differ¬ 
entiable functions of x. 

The differentiation of a product can likewise be carried out to give 
4 U(x)L 2 (x) = Li(x) 4 L 2 (x) + 4 • W. 



Check this by taking as an example 

/3 2x\/cosx 0\ 

\x* x 2 ) \sin x x) 

and differentiating. 

Integration (seen as the opposite process to differentiation) will give 

/ ! x /(t)dt r g (t)dt 

L(x) dx = I J L 

yj h(t) dt J i(t)dt 



where t is a variable of integration. 
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9.22 The problems* which occur are those when the eigenvalues of M 
are not distinct, e.g. in the case such as 



The theory discussed is valid but it is impossible from the method 
given to find e Mt (= bol + biM) since we only manage to get one 
equation twice. This means that we shall have to use another method 
for finding e Mt . 

Suppose we are dealing with 2x2 matrices and the equal eigenvalues 
are 2, then if our matrix is M ^ in this casej, we have 

M n = nl n ~ x M — (n — 1 )X n I 


Now e M = I+M + —+ — + — + . . . 

= /+Af+ 1 [2AM - A 2 /] + 1 [3A 2 M - 2A 3 /] 

+ i [4A 3 M - 3 A 4 /] + . 

. fi _ £ _ 2A 3 _ 3A 4 _ 1 

1/ 2! 3! 4! ' ' 'J 

+ M^l+A + i + i + ...J 

= I\e\\ - A)] + Me\ 

|Vhe first series has a general term of 

-in - i)A" = _ r i _ n 
n ! |_(« - 1)! nlj 

This gives the series as 1 — [Xe x — (e x — 1)]J. 


A* 


* But see the theorem on p. 125. 
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Hence if f is a constant 

e Mt = /[<>«(i - AO] + Me xt . 

We have now solved the problem if we know the initial conditions, 
dx 

since if at t = 0, x = A, — = B. Then 

0—“(b) 

and if 

2 !)• A - 2 

= te 2 \\ - It) e 2t \ /A\ 

\ 5e*-2t.e u ) ' \B/ . 

This leads us, as usual to the general solution in the form of 

x = e 2l (Ct + D). 


Exercise 9.2 

Use matrices to solve the following differential equations, with 
dx 

initially at t = 0, x = A, — = B. 


1 < h*. — 3— 4. 2 x = 0 
dt 2 5 dt +ZX U> 

2. ^ - 6 + 9* = 0. 

dt 2 dt 


3 .^ + 4 *- 0 . 


9.23 We have solved the equations, so far, to give us the complementary 
function , i.e. the solution of an equation where the R.H.S. is zero. 
We now turn to the problem of the Particular Integral 
Suppose our equations are given in the form 

D[x] - M[x] + [z] 

where D means and M is a matrix [x] and [z] are vectors. We have 
dt 

solved the equation 


D[x] - M[x] 



208 Matrices: Pure and Applied 

where at time t — 0, [x] = [x 0 ], to give 

[x] = e**[xo]. 

Consider now 

D[x] - M[x] = [z] 

where z is a function of t. Now 

D(e -M< [x]) = + e ~ Mt . D[x] 

= e-^«[D[x] - Af[x]] 

= e~ Mt [z]. 

Integrating we have 






e~ Mt [z] dt 


where this vanishes at time to, ie. 

[x] = e Mt f e~ Mt [z] dt. 
Jto 

Our complete solution is then 

[x] = e m [[xo] + Jf 

9.24 Example. To solve the equation 

d 2 X . dx , , -. 

—— 5 6 x = 3t 
dt 2 dt 


i.e. 


i (;)=(_“ j)C) + (y wi " re 

We have already found the complementary function,* so we only concern 
ourselves with the particular integral. 

This is given by 

Our basic problem then is to find 

£'*•(:>■ 


* See 9.18. 
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This is integrated by parts remembering that when t = to the integral 
is zero. 



=m 

Hence the complete solution is 

(x\ = / 3e 2t - 2e 2t e 3 * - e 2t \ /x 0 \ , l\t + A\ 

\j/ \—6<? 3( + 6e‘ u — 2e 2t J \j’o/ \ i ) 

i.e. 

x = Ae 2t + Be 3 * + \t + A 
which is the same as we should normally get. 

The final problem is to solve an equation such as 

f^_5^ + 6x = e 5 * 
dt 2 d/ 

where the right-hand side is of the form of the complementary function 
and is dealt with in 9,28. 

Exercise 9.3 

Find the particular integral in the following cases: 

1. **_S^ + 6x - 2t 2 . 

dt 2 dt 

A d 2 ,x . dx . f . 

2. -=r- — 5 — + 6x = cos t. 

dt* dt 

9.25 We include a different method for solving the equations when 
they have equal eigenvalues in the matrix. This uses the Jordan 

CANONICAL FORM. 
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This means that any matrix with equal eigenvalues A can be written 
A = B~ l AB 


as 

where 


and 


-co 


is to be determined. 

Since A has equal eigenvalues 


(a - X)(d - A) - be = 0 
A 2 — (a + d)X + ad — be = 0 


has equal roots, i.e. 
Now 


(a + d ) 2 = 4(a<i — 6c) 


/a MW A -f\n l\(e A 
\c A \—£ e/\0 x)\g h) 

wljere A = eh —fg 


X + h -Z — 
z + A A 

A A, 


i.e. 


‘ ,_A + r 


(■) 


These four values can easily be shown to satisfy (i). 

9.26 Example . Find the matrix B (this is by no means unique as 
we shall see) so that: 

u :k(4 -% x a 
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i.e. where 



Our equations become 


2 






A 



Effectively we have only three equations 

hg — —A 
A 2 = A 
g 2 = A 

We can take A = 1, then h = — 1, g = 1 are possibilities, since 
A = 1, 1 = eh —fg 9 so that e + / = —1, and we could consider 
e = 4, f = —5, for instance. 

Our solution is 


u 3-c: x ;k: =!) 

This is only one of many possibilities. 

9.27 Consider 

j t [x] = A[x] 

where A is a matrix with equal eigenvalues, A say, then 
[x] = e^[xo] 

= £/ + At + -- + . . .J [xq] where A — B~ l A B 


■K3 


B + B- 1 At B + B - 1 B + 




[x»] 




= B- 1 


1 + Xt + fr + 
0 




1 + + IT + ‘' 


5[x«] 
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and since B _1 and B are constants depending on our choice 


i.e. 


w-G* '3(b) 

x = e**(A + BO 

— = Be^ 
dt 


as usual. 

Exercise 9.4 

Use this method to solve 

.g d%x m dx . . _/\ 

1. -nr — 4 — + 4x = 0. 

dt 2 dt 

^^90. 
dt 2 dt 

Problems of the particular integral 

9.28 Consider, for example, the equation 


3 M “(-6 J)w+G»)- 


We have already found the complementary function* and so we only 
concern ourselves with the particular integral, i.e. 


[x] = e Mt e~ m dt where M = ^ ® * j. 

n theore 

-U 


Now using Cayley-Hamilton theorem as before 

3e 2t _ 2e 3t e 3 * — e 2t \ 
-2^/ 


and 


so 


e -M* /0\ /l-e‘\ 

Ve 3 */ \3 - 2e f ) 

(‘e-m 

lo W U-2^ 


-6^ + 6e 2t 3^ 

3^-2« — 2^^ e 
6e~ 3 $ + 6e~ 2 * 3e‘ 


zt _ £-2* \ 

-3* _ 2<r 2 7 


and the particular integral is 

3e 2t - 2^ & - e 2t 

-6e* * + 6e 2 * 3^^ 


(J 


-> W t-e* \ = /(I - Os 3 *'! 

- 2e 2t ) \3t - 2e ( J \(3f - 2)^/' 


* See 9.18. 
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Hence the complete solution will lead to (as usual), 
x = Ae 2t + (Br + 

9.29 Consider now an example in which the matrix has equal eigen¬ 
values and the complementary function is of the form required for the 
particular integral, e.g. 

■>+£) 

[x] = e Mt J e~ Mt dt where M = ® 

The equal eigenvalues are given by X = 3. Now 

Mt t , M*t 2 M 3 t d , 

e-ut = / — Mt + —-— + . . . 

We could use the Jordan Canonical Form to help us here. 

Suppose M = B~ X AB, where B is to be found as previously, 

u a-c-*,'..%) ■ —■•*-(; a 

Hence hg — —3) 

h 2 = 11 possible solutions being h = 1, g = —3, 

S 2 = 9j 

also eh — fg = 1 => e + 3/ =» 1 
and we can take c = —2,/ = 1. Now 


[x] = e Mt f e-** dt 

5 KH' 

- - (f '?) £ (0 


— te~ 2t 
e 

—te~ 3t \ 1—2 1 


-e mcr)* 


any 

5 t;»« 


dt 

dt 



This gives us, as we expect x = At 2 *? 1 . 
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Exercise 9.5 
Use matrices to solve 

'■w- 4 ^ + ix - 2e “- + 

2 ^- 2 § + X - e ‘- 4.§i + 4, = 3si„ 2 , 

5. Investigate the solutions of third order differential equations 
using matrices. [Hint the Jordan-Canonical Form gives something like 

(hi 0 \ 

0 hi) 

\0 0 h) 

for two equal roots.] What do you think it would be for 3 equal 
eigenvalues ? 

The above methods are applicable to all differential equations of the 
form x + k 2 x + k^x — 0. Equations of this form occur frequently 
in mechanical or electrical systems. It is in solving simultaneous equations 
of this form (e.g. in coupled systems) that the power of the method is 
apparent. 

9.30 Example. Consider two elastic strings as shown, separating two 
masses, m\ and m 2 . 

Equilibrium position: 



Accelerations relative to A: x,. x 2 


If the strings have stiffness factors k\ and k 2 , then by Hooke’s Law 
T± = kixi , F 2 = k 2 (x 2 — xi) (1) 

Using Newton’s second law (.P = mf) for each mass 


mixi = J2 — T\ 
m 2 x 2 = —T 2 


(2) 
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Using (1) 


Hence 


I mix i = kz(x2 — * 1 ) — kixi 
\m2X2 = —k%(x 2 — *i) 



(ki + fo) v , kz ^ 

mi mi 

, kz kz 

H- Xl — - X2 

m2 m2 


Put wi = xi and W 2 = x 2 . Then 


wi = 


^2 = 
which can be written as 


-(ki + k2) xi+ kz x2 


mi 

mi 

= Wl 

kz 

kz v 

= - Xl 


m 2 

mz 


W 2 



(3) 

(4) 


(5) 


( 6 ) 


We now find the eigenvalues, Ai, A 2 , A 3 , fa of this matrix, and compute 
the corresponding eigenvectors. 

Let the matrix A be the matrix whose column vectors are these 
eigenvectors. 

Then the solution to our differential equations can be written 


( P^\ 

Qe* 2t 1 

R e** I 
S e**J 

where P, Q, R, S are constants. 

The solutions are of the form 

xi = pie Xlt + qie* 2 * + ne** 1 + sie* 4 * 

X 2 = pze* 1 * + qze* 21 + + s&W 

Let us suppose mi =?= m 2 *= 1, ki =s 1 and kz = 2. 



(7) 
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( W1 \ 

I 

1*1 

1- 

1 VV 2 1 

f 

\X2/ 

\ 


Equation (6) can then be written 

/0 -3 0 
0 0 
2 0 
0 1 

The eigenvalues of the matrix are then given by 

— A —3 0 2 

1 —A 0 0 

0 2 -A -2 

0 0 1 -A 



= 0 


A 4 + 5A 2 + 2 = 0 


A 2 = 


-5 ± V25 - 8 


Hence the eigenvectors are 

■ *1 =y\ 


5 + V17 


A s =/' 

Let us call these 

where 


5 - V 17 


h = ™ 


A 4 = -j 


-5 ± Vl7 


5 + Vl7 


5 - V17 


Ai = mj, A2 = —mj 
A3 = nj, A 4 = — «/ 


m = 


/5 + V17 


« = 


V17 


Then the required solution is 

xi = piei mt + qie~3 mt + net”* + sie~i nt 
X 2 = p-2^ mt + qie~1 mt + r-ie 1 " 1 + S2e~^ nt 

which we usually interpret as 

xi = Ai cos mt + Bi sin mt + Ci cos nt + Di sin nt 
X 2 = A2 cos mt + B 2 sin mt + C 2 cos nt + D 2 sin nt 

where Ai, Bi, Ci, Di, etc., are arbitrary constants. 

Exercise 9.6 

1. Solve the preceding problem when 

m\ = m% = 1 , *1 = 2, fc 2 = l 
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2. In the worked example show that, if each body is damped by a 
force equal to c times its velocity, then the differential equations 
describing its motion can be written as 



3. The displacement x of a simple harmonic oscillator satisfies the 
differential equation 


d*x 
dt 2 


+ a) 2 x = 0 


Denoting by M and M' the function x = a cos co(t — a) and 
x = a' cos co(t — a'), show that the functions AM and M + M', 
defined by 

x = Xa cos oo(t — a) 

and x — a cos a)(t — a) + a' cos co(t — a') 

respectively, satisfy the differential equation. Prove that the solutions 
of the equation form a vector space. What is its dimension ? 

The scalar product M . M f is defined as aa! cos to( a — a'). Prove that 

(M + M'). AT = M . M" + M '. M” 

Writing M = m\ei + where e\ and denote the functions 

x — cos cot and x = sin cot respectively, express M . M' in terms of the 
components (mu m2) and (mi, m%). 

If the time t is measured from the instant t = 0 , so that t = t — 6, 
and M is written as m\e\ + ^ 2 ^ 2 , where e\ and H are the functions 
x = cos cot and x = sin cot respectively, find the relation between the 
components (mi, m2) and (mi, m 2 ). What is the nature of the matrix 
involved? (CS) 


V FOUR-TERMINAL NETWORKS 

9.31 A four-terminal network is an electrical system (or black box) 
with two input and two output terminals, i.e. 


M 



V2 
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Basically all that is required is a knowledge of Ohm’s law and common 
sense, to arrive at the various sets of equations which are satisfied by 
the voltages and the currents, for example: 


I, 5 Q I 2 


ii 

h 


v 2 

v * j 

20 

r 

0-0 


0- 1 

[-0 


Vi = 5/2 + V 2 (Ohm’s law) V± = V% 

/i = / 2 V 2 = 2(/i - I 2 ) 

or h = \ V% + h 

or in matrix notation 

© - G DC?) G?)-UC?) 

In general for a resistance RQ. (as on the left above) we have 


ra - c no 

and for a resistance as on the right above of SQ, 



9.32 We can now apply our ideas to a number of circuits joined up 
together as, for example: 

I, I3 h h 

O——MVWWW-►-O-O--WWWW-►-O 



RCi 


rQ 

V, 

V 3 


V 3 


O-o-0-o 


We now have, for the left hand part 

(ftK X) 

and for the right hand circuit 

(« 3 © 



Further Situations in which Matrices are Used 


219 


Hence 

KK DC 3(3 

-c •;%:) 

and the rule for adding resistances in series is established. Similarly 
we can show that for two resistances in parallel the total resistance is 
given by 

l-i + i 

R n /*2 

Prove this for yourself. 

We can also combine the two circuits as 
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9.32 Do we obtain the same results if we combine the circuits in the 
reverse order ? i.e. 



In general, if we have a number of circuits combined and the im¬ 
pedance* of each circuit is Zi, Z2, Z3 . . i.e. 



Inductive and capacitive circuits 

9.33 In the following circuits, w is the frequency of the applied voltage. 
INDUCTIVE 


I, L I 2 

O---' OQOOOObOOO >---o 



* The impedance of a circuit is its resistive effect. 
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CAPACITIVE 



Exercise 9.7 

1. Investigate the properties of matrices of the type 



Notice that the determinant of such matrices is +1. Do such matrices, 
under multiplication, form groups. What types of transformations of 
the plane correspond to these matrices? 

2. Many circuits involve cascade circuits, i.e. a large number of 
similar units joined up. To find the total resistance of a circuit involving 
n series resistances requires that we can find for example 



How is this done? Can you use a similar method to find 



3. A transmission line, having a series impedance of Z per unit 
length and a shunt admittance |i.e. impe ^ ~ n ~) of Y P er unit len S th ma y 
be considered as the limit as dx —> 0 of a cascade connection of a 
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number of networks, each corresponding to a length dx of the line, 
i.e. one unit would be 



The corresponding matrix is 

«-(■ 41 < 


■t D(i 5t 2 ?) 

/1 + zy^ Z(5x + zay 
1 2 4 


i + 2y: 


If / is the length of the line «(5x = /, so that 

/ zy / 2 z/ z 2 y / 3 

lir / 1 + “2^ n + 4« 3 


The problem is then to find M n , again. There is a ‘Binomial Theorem’ 
for matrices (due to Proctor Wilson), i.e. 

Ichy + kshy Zo(l — k 2 )*shy\ n 

( 2 (1 “ k 2 )*shy chy — kshy J 

( chny + kshny Zo(l — k 2 )hhny\ 
~ (1 — k 2 )hhny chny — kshny J 

Prove it by induction. 

4. The y matrix is the inverse of the impedance matrix of a circuit 
Z, where 

(£K;i ::)© « — 

so 

®-(E °' ,= rr 
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Consider 



Here 


But 


©-ms - 


and 


/« = /£ + n 


Prove a similar result using Z matrices for the circuit 



For further information see books by J. W. Head and Mary Tropper 
Matrix Theory for Electrical Engineering Students (Harrap), also 
Matrix Algebra for Electronic Engineers by P. Hlawiczka (Iliffe). 

VI AFFINE TRANSFORMATIONS 

9.34 We have met before the idea of transformations of the plane 
by the application of matrices. We now consider transformations of 
the type 

(;K XK) 

or x —► ax + by + e\ 

y-+cx + dy + f) 

It is easiest, to begin with, to take particular values for a , b, c, d, e and/, 
and to investigate this particular transformation. Consider 

x -~*x —y + 1 
y-~*x + 2y 
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As with the theory of knots,* our problem is to find out what remains 
invariant under such a transformation. 

We shall find the images of two easy lines which are perpendicular, 
i.e. y = x and y + x — 1. 



It is quite clear that straight lines are transformed into straight lines 
but the angle between them has altered. Are parallel lines mapped 
into parallel lines? Try y + x = 1 and y + x = 3. Now experiment 
for yourself, using the above transformation and answer the following 
questions. 

Exercise 9.8 

Investigate whether under the transformation given. 

1. Every point has a unique image-point. 

2. The image of the intersection of two lines is the intersection of the 
two image lines. 

3. The distance between points is not invariant. 

4. The distances between points on the same straight lines are enlarged 
by the same factor. 

5. The midpoint of a line segment is mapped into the midpoint of 
the image line segment. 

6. Circles transform into circles. (Consider points on the circle 

* 2 +J 2 = 25.) 

7. Do tangents transform into tangents? 

8. Three non-collinear points are mapped into three non-collinear 
points. 

9.35 We have considered a particular transformation, i.e. 

0-c ~;)(;K) 

What problems are associated with a transformation such as 

0-G !)(:K)’ 


* See 9.46. 
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In fact we have met these problems before when we were investigating 
the transformations of the plane. To avoid such problems we shall 
only deal with transformations which are 1-1 and so 

ad — be 7 ^ 0 

is a necessary and sufficient condition for this. 

It is interesting to note that it is possible to find a transformation 
under which any three given points can be mapped into three predeter¬ 
mined points. For example, any triangle can be transformed into an 
equilateral triangle. For instance, is it possible to find a transformation 
of the points, (3, 0), (1, 3), (—1, —1) into the points (2, 0), (—2, 0) 
and (0, 2 1 / 3 ) ? Suppose the transformation is 

x —► ax + by + e 
y ->cx + dy + / 

Then the image of (3, 0) is (3a + e, 3c + /), and this must be (2, 0). 
Hence 

3a + e = 2; 3c f = 0. 

Similarly, the image of (1, 3) is (a + 3b + e, c + 3d +f) which must 
be ( —2, 0 ), so that 

a + 3b + e = —2; c + 3d +f= 0. 

Finally the image of (—1, —1) is (—a — b + e, — c — d +/), which is 
(0, +2V3), SO 

—a — b -j- c = Oj — c — d - {- f = 2V3 

We have six equations in six unknowns and so there may be a unique 
solution: In fact 



In general, then, any three non-collinear points may be transformed 
into the three vertices of an equilateral triangle. 


9.36 It is now clear that it is possible to map a parallelogram into a 
given square. 

We are now able to prove many properties of triangles and parallelo¬ 
grams by considering them to be mapped into equilateral triangles and 
squares, proving what we require with these and remembering what 
properties are invariant under the transformation and then mapping 
back to our original figures. 
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This idea is a very important one in electro-statics where we are 
interested in, for example, the field round an elliptical conductor. 
The whole problem is transformed into finding the field round a circular 
conductor, which is much easier, and applying the transformation 
which will take the circle into the ellipse. 

Exercise 9.9 

Prove the following, using the ideas of transformations in general. 

1. The ‘mid-point theorem’ for triangles. 

2. The medians of a triangle intersect and do so at the point of 
trisection of the median, i.e. so that here AG = § AA'; BG = § BB'. 



3. The diagonals of a parallelogram bisect each other. 

4. The midpoints of the sides of a parallelogram form the vertices 
of another parallelogram. 

5. Investigate circle theorems and see if you can find corresponding 
results for the ellipse. 


Group structure 

9.37 The affine transformations form a group under the rule of 
‘followed by’. 


Closure 

Consider the transformation in the matrix form 


X —► AX + B, 



or 
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If we follow this with X -> CX + D. Then 

X AX + B A(CX + D) + B 

—► (AC)X + (AD + B) 

which is another affine transformation. 

Associativity 

This can be demonstrated easily. 


Identity 

The identity transformation is the one such that. 


X 


7X + O 


where /-(J J) 


Inverses 
The inverse of 


X —► AX + B 


will be 
where 


A~ X (AX + B + B -1 ) —► X 



and 


A 1 ad-be (-c 



9.38 It is clear that the transformations of the type X -► AX will 
be a subgroup of our group of affine transformations. Under what 
conditions do the affine transformations give us (a) isometries (i.e. 
transformations in which distance is invariant, ( b ) similarities (i.e. 
transformations in which angles are invariant)? 

In fact for isometries 

X -► AX + B 
where A is orthogonal, i.e. 

A .A' — I 


(A! is the transpose of A). 

For similarities 

X AX + B where A . A' = kl 

Algebraically, for isometries 

& + b* = c 2 + = 1 

ac + bd == 0 


and 
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For similarities 

a 2 + b 2 = c 2 + d 2 = k (k > 0) 
and ac + bd = 0 

Note that 

ad — be ^ 0 

Check that these are true by taking particular values for a , b 9 c , d 9 e 
and / which satisfy the conditions. For further information on affine 
transformations see, for example, A Survey of Modern Algebra by 
Birkhoff and MacLane. 

VII MATRICES AND THE THEORY OF GAMES 

9.39 We shall be considering games of strategy (not of pure chance) 
between two players, each of whose ‘moves’ influence but do not 
completely determine the outcome. Each player must try to make the 
best possible moves, regardless of what his opponent does, and must 
try to ensure that his ‘payoff’ will be as large as possible. 

For example, let us call two players R and C ( R will have command 
of the rows and C will have command of the columns in our matrix). 
R has a silver sixpence, and six copper pennies in a pile (a copper 
sixpence); C has a silver sixpence and a copper penny. 

R has two piles S6 Cu6 
C has two piles S6 Cul 

The game is played by exposing the coins simultaneously; if they match 
in material R wins the positive difference in amount, if they do not 
match C wins the difference in amount. We can then display the winnings 
in the form of a matrix. 


Player C 
S6 Cul 

S 6 

Player R 

Cu6 


The entry —5 means that R pays C fivepence; the entry +5 means that 
C pays R fivepence. The game is completely described by the matrix. 

9.40 Strategies. Player C would obviously like to achieve the entry 
—5 by choosing the second column. He then stands to win (or lose) 
fivepence, depending on K s choice of row. It is obviously sounder to 
choose column 1 since no matter what R does, C will break even. 
R on the other hand, would like to achieve + 5 by choosing the second 
row. 


0 

-5 

0 

+5 
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The optimal strategy then is for player C to play S6 and for player 
R to play Cu6. 

9.41 The value of a game is the outcome if both players use the 
optimal strategy. If the value of the game is zero, then the game is said 
to be fair . 

A game is said to be strictly determined if the matrix contains an 
entry v which is simultaneously the maximum of its column and the 
minimum of its row. v is the value of the game. Strictly determined 
games can be solved easily since each player can calculate his opponents 
best move. 

In our example, the game is a fair, strictly determined game. Not all 
strictly determined games are fair. 

Examples 

1 . 2 . 


Fair; strictly determined. Strictly determined, value 4 



0 

3 

-2 

5 


0 

3 

2 

1 


Not strictly determined. 


2 

3 

4 

-1 


Not strictly determined. 


Exercise 9.10 

Find out which of the following games are strictly determined and 
give the value of those that are. 

1 . 2 . 



6 

0 

0 

3 


2 

-1 

-1 

3 


4 

2 

0 

5 


3. 


4. 
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5. 


6 . 


6 

0 

0 

“1 



9.42 It is quite clear that some 2x2 matrix games are not strictly 
determined, i.e. they have no entry which is simultaneously the maximum 
of the column it is in and also the minimum of the row it is in. 


Example 

Consider a game in which one player hides in his hand either a 
sixpence or a shilling. If the other player guesses the coin then he wins 
it, if not he gets nothing. The games matrix is. 


R chooses 


C guesses C 

6d Is 6 12 


6d 

-6d 

0 

or in pence 6 

-6 

0 

Is 

0 

-Is 

R 

12 

0 

-12 


The game is not strictly determined. How should it be played? If the 
game is played for a number of times it is clear that R must not play 
(or hide) the same coin all the time, for C would soon guess this. 
Nor can R afford to play in any fixed pattern, for once C has realized 
this then, he can always win. 

We now introduce the idea of a mixed strategy— for example, for 
R we say that he is to play row 1 with probability pi and row 2 with 
probability p 2 with pi + p 2 — 1, pi >0 ,p 2 > 0. Similarly for player 
C a mixed strategy will be to play column 1 with probability qi and 
column 2 with probability q 2 , with as before qi + q 2 = 1, qi 5 s 0, 
q 2 > 0. 

A mixed strategy vector for R is (pi, pi) and for C is 



DEFINITION 

9.43 For a non-strictly determined game of value v, with best strategy 
vectors of (pi pi) for R and for C then 


(i) (pi pi) ^ ™ ^ > (» v) 



where is the game matrix. The game is fair if v = 0. 
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If both R and C use their best strategies, equality occurs. 

The expectation of R ’s pay off will be (iii): 

apiqx + bpiq 2 + cp 2 qi + dp 2 q 2 = v 

(since a occurs with probability pxqi and so on). We can now calculate 
the values of p± 9 p 2 , qi and q 2 and also v in terms of a 9 b 9 c and d, since 


from (i) 

pia +pzc = v 
pib + pzd = v 

from (ii) 

aqi + bqz = v 
cqi +dq 2 ~v 

and (iii) 

apiqi + bpiqz + cp 2 qi + dp 2 q 2 = v 


Solving (i) remembering that pi + p 2 = 1, 


and similarly 


and 


d-c t 

pi = ———7 -; p 2 = 1 — pi 

a + d — b — c r 


d-b - 

?i — ——---; q 2 = 1 — qi 

a + d — b — c 


v = 


ad — 6 c 
a + d — 6 — c 


Note that v = 0 when 


a 

c 


b 

d 


= 0 . 


9.44 First of all, let us consider the previous example (p. 230). 
The above results give: 

- 12-0 2 1 

Pl -6-12-0-0 3 =>/ ’ 2 3 

- 12-0 2 1 

qi -6-12-0-0 3 ^ q * 3 

_ (~ 6 ) (- 12 ) ~ 0.0 

- 6 - 12 - 0-0 
+72 
~ -18 

= -4. 
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This means that R should play row 1 with probability f, row 2 with 
probability C should play column 1 with probability f, column 2 
with probability Also the value of the game is given by, 

—6 .§.f + 0.§.J + 0.£.§4 -12 . £ . £ = —4 

as before (this is R’s expectation). 

9.45 Now let us consider the game matrix 



2 


5 


Here 


5-0 

pi ~ 4 + 5-2-0 
5-2 

?1- 4 + S- 2- 0 
4.5 - 2.0 

^ ~ 4 + 5 - 2 - 0 

= 2f. 



It is possible to find the probabilities quickly by the following method. 


Consider again. 


C 


- 

0 

E 

-12 


To find the odds we proceed as follows. Subtract the numbers in the 
second row from the corresponding numbers in the first row, to get 


+12 


We neglect the minus sign and the probabilities for C are as follows. 
Divide by 12 + 6 to give probabilities 


and then swap them round to give 



Probability of Probability of 

playing 1st col. playing 2nd col. 
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We can do likewise for R by subtracting numbers in the second column 
from the corresponding numbers in the first, changing the odds to 
probabilities and interchanging as follows: 


It is easy to show that the formulae and this method are equivalent. 


-6 


+12 


12+6 \ 

— 12 — J 

12+6 / 


Probability of 
playing 1st row 

Probability of 
playing 2nd row 


-6 

0 

0 

-12 


Exercise 9.11 

Find the best strategies for the following games and the value of 
each game. 



4. 


5. 


6 . 


7 

3 

11 

2 


1 

0 

-2 

4 


7. R has two airfields. He has only enough troops to successfully 
defend one at a time. C can attack one or the other, but not both. 
One of the airfields is twice as valuable as the other. What strategies 
should each adopt? 

8. A man is rushing home late and he suddenly remembers that it is 
his wife’s birthday—but he is not sure. The only shop open is a 
tobacconist’s where he can buy chocolates. 

If he comes home with no chocolates and it is not her birthday, 
nothing will be said, i.e. pay-off 0. If he comes home with chocolates 
and it is not her birthday, she will be delighted and offer him a couple 
of drinks (pay-off 2). If it is her birthday and he has remembered the 
pay-off will be 3 say! If he has forgotten and it is her birthday a miserable 
evening lies ahead (pay-off —12). What should he do? 

For further information on Games Theory and Matrices see The 
Compleat Strategyst, J. D. Williams, McGraw-Hill, ‘Teach Yourself 
Operations Research’, and for applications see for example Quantitative 
Geography , Cole and King. 
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VIII KNOTS 


9.46 The elementary theory of knots is an easy example of how 
matrices can be used to describe various situations. Knots can be 
divided into two basic types: 

(a) those which can be regarded as unknotted and are of the same 
type as the circle. 

(b) those which are knotted. 

Knots may be altered by twisting, bending or stretching but not by 
cutting and rejoining. Under these simple transformations we have to 
search for an Invariant (something which does not change by twisting 
etc.). This search for an invariant property is a common one in Mathe¬ 
matics and often it is not an easy property to find. What we require, 
then, is to find a quantity which is unique for each type of knot and 
remains constant under our simple, allowable transformations, e.g. 



cxoo 


These two diagrams represent an unknotted situation, the second being 
a loop twisted over. Here is another unknotted situation: 



What have such knots in common? 


The determinant of a knot 

9.47 (a) Shade the regions in a chessboard fashion, alternately 

black and white, with the outside (infinite) area black, e.g. 
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(b) Give a value to each cross-over point, either +1, — 1 or 0 as shown. 



(This value is given when the two shaded areas are, in fact, the same.) 

(c) We now number the shaded regions (except for the infinite one). 

(d) Form the matrix K with entries as follows: 

k pp = the sum of the values on the area labelled p, 
kpq = kqp — minus the sum of the values common to regions 
p and q . 

(e) The numerical value of the determinant of AT is a knot invariant. 
9.48 For example , consider the knot below: 



The matrix 


and so det K = +5. 


*= (m - 3 ) 
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Note. For a circle, clearly of the same form as the invariant is +1. 



This is a rough and ready method for determining the type of the knot. 
There is a much more powerful method as we shall see. 

Exercise 9.12 

Find the knot invariant in the following cases: 




A more powerful method 

9.49 (a) Orient the knot by following round the knot and putting 

in direction arrows. 

( b ) Label each cross-over point and each piece of the knot (i.e. 
between going under one part to going under another). 
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(c) Give the value +1 and —1 to the pieces at a cross-over point. 

\/ "\/ +1 

+ ,/v, A 

(d) Find the matrix K with rows corresponding to cross-over points 
and columns corresponding to pieces. 

(e) Construct L by deleting last row and column of K. 

( f ) Construct M from L by adding first column to second, second 
to third and so on. 

(g) For any value of n e Z + , the set of positive integers, the numerical 
value of the determinant of the matrix 

M n -(M- I) n 

is another knot invariant. 

9.50 For example , consider the knot 



Show that for 

n = 1 the knot invariant is 1 (always) 
n = 2 the knot invariant is 3 (same as by first method) 
n 3 the knot invariant is 4 
n = 4 the knot invariant is 3 
n = 5 the knot invariant is 1 
n 5= 6 the knot invariant is 0 

n = 7 the knot invariant is 1 and the pattern repeats. 
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The values given for n == 2 and n = 3 enable us to distinguish the 
different types of knot apart from reef knots and granny knots. 

Exercise 9.13 

Choose one of the first three questions in the previous exercise and 
calculate this new knot invariant. 

For further work on knots see articles in the Scientific American, 
and books, e.g. Introduction to Knot theory —Cromwell and Fox, 
Ginn 1963. 

IX MATRICES WITH COMPLEX ELEMENTS* 

9.51 A matrix with real elements can have three main operations 
performed on it: negativizing, transposing and inverting. The first 
of these consists of just multiplying each element of the matrix by — 1: 
it is written — A . The second consists of writing the rows as the columns 
of the transpose matrix, and the columns as the rows: thus an m x n 
matrix becomes n x m when transposed. It is written A\ The third 
operation consists of finding a matrix A* 1 such that AA~ X = J, 
where I is the unit matrix for the set of matrices concerned. 

If a matrix has complex elements, we may add a fourth operation to 
these three. This operation consists of replacing each element of the 
matrix by its complex conjugate, to give a new matrix A. 

9.52 Consider an Argand plane with complex axes u and v. These can 
be transformed by a matrix 

where a, /J, y and b may be complex. Then if the transformed axes are 
u', v ' we have 

u' = aw + fiv 
v' = yu + bv 

We restrict this matrix Q to be a unitary matrix of determinant 1. 
A unitary matrix is a matrix with the special property that A (A') = /, 
or (A')- 1 = A. If a, {3, y and 3 are the complex conjugates of the 
elements of Q , then 

Q(Q') = / 

where Q is the matrix which has the complex conjugate entries corres¬ 
ponding to the entries in Q, i.e. if 



* We are indebted to Andrew Ellis for this section. 
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Now 

/a A /a y\ _ /I 0\ 

\y W *)-\o V 



aa + fifi = 1 

(1) 


ay + f}8 = 0 

(2) 


ay + fid = 0 

(3) 


yy + dd — l 

(4) 

The second and third equations are equivalent, so 



olol + fifi — yy + dd = 1 

(5) 


aa + fifi — 0 

(6) 


Equation (5) is an equation in real numbers, since oca, /S/5, yy, SS are 
all real, Equation (6), ocy + fid = 0, contains complex numbers, and 
so we have two conditions imposed. We have also a<5 — fiy = 1 (since 
A = 1). All told there are five conditions. We usually leave the matrix 


Q as 



although we may express it in terms of 3 independent 


variables. 


9.53 Consider for example a matrix 


= ( f. x ~ jy ), 

\x+jy -z ) 


\x+jy 

where x,y , z may be taken as the coordinates of a point in 3 dimensions, 
or as any 3 real ordered numbers. Note that 


P ' = / * *+;>y = / z = 

\x—jy —z J -t J 


(7) 


A matrix with this property, P — {?'), is called self-adjoint or 
hermitian. The matrix P also has the property that its trace (i.e. the sum 
of the elements along the main diagonal) is zero; in this case 

—z + z = 0 ($) 


Let us transform the matrix P as follows, to give 

Pi = QPQ' = QPQ- 1 since Q = Q- 1 

The two properties (7) and (8) mentioned above are invariant under 
transformations of this type. Hence the new matrix Pi may be written 


Pi 



z i 

XI +jyi 


xi -jyi\ 

-Zl ) 
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Also, as the determinant of Q is 1, the determinant of Qr 1 is also 1, 
and so 

APi = AQ . AP. Ag-i 
APi = AP 

i.e. 

—z 2 — x 2 — y 2 = — z\ — x\ — y\ 
x 2 + y 2 + z 2 = x\ + y\ + z\ = AP = APi. 

If x, y, z are coordinates, then the length of a general vector 

AAA 

h = xi + y] + zk 

is unaltered by any transformation with this property. The transforma¬ 
tion is a rotation of some sort, and can be represented by a 3 x 3 real 
orthogonal matrix, since an orthogonal matrix is defined as one for 
which the transpose equals the inverse, or 

A = (A')- 1 

The unitary 2x2 complex matrix Q corresponds to a real 3x3 
orthogonal matrix. This matrix is found as follows: 

9*54 Let D be a 3 X 3 orthogonal matrix, transforming (x, y, z) 
to (xi, yi, zi). Then h' = Dh. Let the corresponding 2x2 unitary 
matrix be Q±: then 


Pi = QiP(Qi)- 

With another 3x3 orthogonal matrix B, transforming (xi, vi, zi) 
to (x 2 , }' 2 , Z 2 ), the corresponding Q-matrix is Qz- Then 

h" = Eh' 

= BDh 

And Pz = QzPiiQ'z) 

Now let C = BD. There is a matrix Qz corresponding to C. But 

p 2 - Q*Pim = q2Qip(Q[)(q / 2 ) 

Since = ( 6261 )', we have Q3 = Q2Q1 as the matrix cor¬ 

responding to C. In fact, all the relations satisfied by the matrices of 
one set (such as D) are satisfied equally well by the corresponding 
matrices of the other set (such as Q ): the two sets of matrices can be 
shown to be isomorphic. Now 


(g')=2- 1 =(_J Q as AQ = 1 
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and thus 


*)U> X -?Vr "«) 

_ + /9(* +7» «(* — jy) ~ P* 


-y»-M/ <5 —A 
-jy)-5z]\-y «/ 


yz + d(x + y» d(x 

'(«d + Py)z + 0d(x + jy) - cty(x - jy) 

- 2 *Pz + a 2 (x -;» - P\x + jy) 

2 ydz — y 2 (x — jy) + d 2 (x + y» 

- (ad + py)z - pd(x + jy) + a y(x - jy)) 


Thus 


zi = (ad + Py)z + pd(x + jy) - ay(x - jy) 
xi + jyi = 2ydz - y\x -jy) + <3 2 (x + jy) 
xi -jyi = -2a pz + a 2 (x -jy) - P\x + jy) 

For a general 3x3 matrix, with entries a u 

Z\ = 031* + 0327 + 033Z 

giving 

zi = (J38 — ay)x + j(ay + pS)y + (ad + Py)z 

and the 3x3 real orthogonal matrix is 

A(a 2 — y 2 + d 2 — P 2 ) \j(y 2 — a 2 + d 2 — P 2 ) yd — a/J \ 

i;(a 2 + y 2 - p 2 - d 2 ) £(« 2 + 7 2 + A + <5 2 ) ~M + <5y) 

\p5 — a.y j (ay + /3d) ad + /3y / 

This is the £>-matrix corresponding to our g-matrix. 

9.55 The position of a body in 3 dimensions may be specified by 
(x, y, z). By means of the matrix above a transformation may be 
applied. It is equally true that the position of a body in space may be 
specified by the four parameters a, p, y and d, which are called the 
Cayley-Klein parameters. 

The original matrix P which we used in these transformations is 

•c ?)*■(; 4 

These three matrices above are called o'*, a y and a z . They are the Pauli 
spin matrices , and are associated with the spin of an electron in orbit. 

These matrices, together with the unit matrix ^ ^ j, form a set of 

four independent matrices, from which any 2x2 matrix can be built. 
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In the complex space considered, the matrix corresponding to a 
rotation through an angle <f> about the z-axis is 

le^ 2 0 \ 

[ 0 e-M 2 ) 


9.56 If we consider a rotation through the angle 2tt about the z-axis, 
in space of 3 dimensions we shall return to the original set of axes. 
But if in the above matrix 

(j) = 2lT 

then 


/efr 0 \ /-I 0\ 
\0 e-*) \ 0 - 1 / 


This matrix must correspond to the 3 x 3 unit matrix. In fact, there 
are two 2x2 complex unitary matrices isomorphic to any given 
3x3 real orthogonal matrix (one being minus the other). Remember 
that the 2-dimensional complex space constructed here is an entirely 
mathematical concept, and so does not require the same properties 
as 3-dimensional space. 


9.57 A vector in the complex space is called a spinor. Although the 
space is a purely mathematical conception, it is used in the realm of 
quantum mechanics. The wave function of an electromagnetic radiation 
must be made a spinor in order to include the effects of electron spin. 
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Answers 


Exercise 1.1 

1 . (a) 2 x 2 (b) 3x1 

(/) 2 x 3 C§0 1x4 

2. Follow since the addition of 


(c) 1 x 1 (d) 1x2 e) 3 x 2 

(A) 2 x 1 (0 4x3 

real numbers is commutative and associative. 


4. 0. 


(c) Does not exist. 


Exercise 1.2 

1. (a) (32) 

(d) Does not exist 

2. In general no. 

3. n =p,m =q. 

6 . A, B must be square and of the same order: further conditions must also be 
imposed on the elements. 


/4 8 12\ 

(b) 5 10 15 

\6 12 18/ 

(e) Does not exist (f) (J* + 5 J J %) 


7. For example (J J)-.(J J) = (J J) 
Exercise 1.3 

3 {rtj’ th)- Inthis0386 (S S) 


Exercise 1.4 
1 . (a) -2 
(d) a 2 - A 2 
(*)0 

5. (S) 43 

6 . (c) (i) -430 

7. (c) (i) 0 


( 6 ) 0 
(«) -8 
(A) abc 
(b) -43 
(ii) -258 
(ii) -7 


+236 

-23 


8 

(iii) —43 


Exercise 1.5 

1. (i) -74 
(iv) 448 


(iii) 7000 
(vi) 0 


(iii) adf 


(iv) a/30 


(ii) -10,636 
™ (v) 12 

3. 1 and approximately 16*5, 1-5. 

4. 0. 

6 . (b) (i) 24 (ii) 60 

l] U| = 98; M 2 | = (98) 2 ; |5^( = 125MI; 5|^| = 5.98; li^'l = *r\A\; 
\ A *A'\ = (98K 

9. (a) (p - $)(? - r)(r - /?) (6) (p - ?)(? - — P)(P + 0 + r ) 

11 (ii) —(jc — y)(y — z)(* — *)(* + y + *)(* 2 + + z 2 ) 

—22 ^ ^ ^ 13. 3 a > 2 14. A, B, C are colhneai 

” a (ii) (x - y) n_1 {* + (n — l)y} 


16. (i) x * a, 1 , 


1 +a 


243 
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17. (i ) (a+ b + c + d)(a-b + c- d)((a - c ) 2 + (b - d) 2 ) 

(ii) (x -f y + z) 2 (x 2 + y 2 + z 2 — xy — yx — zx ) 2 

18. (i) —Mb — c){c — a)(a — b) 

(ii) (x + a + b + c)(x + a — b — c)(c — b + a — x)(b — c + a — x) 


Exercise L6 



Exercise 2.3 



/ 1 2 3\ /I 2 3\ 

(iv) 14 25 36 (v) 2 1 3 

\ 7 8 9/ \7 8 9/ 


Exercise 2.5 
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Exercise 3.1 

1.x = 1,^ = 2 2. x = 1, y = -1, z = 4 

3 . x = — 2 , >>= 3, z = 4 4. A — 0 


Exercise 3.2 

1. (i) x = —t, >> = 3, z = f (ii) a: = —2, >> = 4, z = 0 

2. (i) x = t, y = — , z = 2/ - 2 (iii) Three identical equations. 

Exercise 3.3 

1. (a) x = 2 , >> = 1 ( 6 ) x = tV, y = -tt 

(c) inconsistent (c/) same equation twice 

(e) X = — 3, y = — 3 (/) X = —27, ^ = 25 

2. Yes. 3. (2,1) 4. * = 1, y = 1 

/i n\ [a b p 

6. ( Q 2 );x = !,, = ! 7. c rf , =0 

Exercise 3.4 

1 . * — 2, >> = 3 2. No solution 

3 . x = —2, y = —4 4. x = f, y = it — | 

5 . x = if, >> = — & 6 . No solution 

7. No solution 
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Exercise 3.5 

1. x = y = f, Z as fi 


6 — 3/ 


3. x = /, 7 = — 2 —» z 1 

5. x = A, y *= y, * == rf 
7. x = 0, 7=0, z = l 

225 24 _ A 

9- *~nr y 39 * z 0 
11. x = — 2 , 7 = 3J, z = 8 i 

Exercise 3.6 
1. (a) No solution 

(c) x = 1,7 = - 1 , z * 2 
to x = t,y 


24-13/ 


2. x = l, 7 = 2 , z = 3 

3-3/ 


, 24 - 2/, 

4. * =-^—- y * 


13 


, z = / 


6. * = 1, 7 = 0, z = —57 
8 . x = 1 , 7 = 10 , z = 1 

10. * = /, 7 = 47 i4 64t > 2 = y - l0t 


(b) Three parallel planes 

. 34-31/ 

to x = t,y =- 5 -, z = 


10-7/ 


— , z = (/) No solution 


C?) * = 3, 7 =* 2, z = 1 
/2 0 0 \ 

2. |0 2 0); x = 113*7 ,7 = —76-8, z = 8M5 

\0 0 2 / 


. 20 + 6 / -8 - 9/ 

(A) * = /, 7 = —, z = 


11 


4. (ii) x = 1 ,7 =* 2, z = 3. 


/ 1 0 1\ 

5. (-14 4 -17); x = 1 , 7 = “ 8 , z = 7; * = 4, 7 = -69, z = 53 

\ 11 —3 13/ 

6 . * = 1,7 = 2, z = —1 

7. x = 10 , 7 = 9, z = 11 (ii) and (iii) No solutions 

8 . a = 3 

/—t A 

9.1 4 -A Al;x = 2,7 - U * = -2 
\ * A -«/ 


10. Xl — 7, X2 — ' 4* X 3 — ^ 

11 . * = i,y = -2 

Exercise 3.7 

1. A = 1, or A » 5 

3. -0-84 

2. x = t, y = —At, 

Exercise 3.8 

1. x « 1 ,7 “ x — 0 

2. x = 2, y = —1, 

Exercise 3.9 

1. x = 2, 7 = 1, z = —2 

K> 

X 

11 

t—i 

00 

£ + 

3. Same equation 3 times 

4. No solution 

Exercise 3.10 

1 . ( 0 , 0 , 0 ), (i, *,!),(¥, -S, 

2 . x = -(a + p + y), 7 = 1 , 

(5, -7, 3) 

z = 1. 3 identical planes 


8 + 19/ 
11 


Z = 


3 . x =^(3 ab - 8 a 2 - 5b + 5ac + 8 - 3c) 

i (2b - 2ac - ab + 3a 2 + c - 3), z = ^ (c - a - X) 
where A = b — a — 1. 
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, , t , — be _ ac _ ab 

4. a * b * c, x - {a _ w(c _ fl) , * - (fl _ /,)(*, - c y 2 " {a - c)(6 - c) 

5. (i) x - i, y = -i, 2 - t (ii) X = U y s 5 3/ " “ 33/ 1 


12 ’ 12 
6(1 - a)(a 2 4- 6 - a - 1) 
{ab - l) 2 


b - 1 6(a 3 6 - a 21 - ab + 1) _ 

6 * * " a(a6 - iy y “ (ab - l) 2 ’ 2 ! 

7. A: = 1 

8. x = j </(26 4- c), y =* (2a 4- c), 2 =» ^ (6 — a) 

where A is the determinant of the coefficients. 

9. 3 abc — a 3 — 6 s — c 3 — O;a4-64-c = O;a4‘04-y ss =O;a = 65=c; 

a = P = y 

10. X = 3kg, Y = Z = 5i kg 


Exercise 3.11 
1 . (a) x+y . 


1 


(«y«i 


( C) f + I= 


l 


2. (a) x 3 + y 2 — 6x + 8y = 20 (6) x 2 + y 2 — 6x — 8y = 0 

(c) Not possible; the given points are collinear. 

(d) x 2 + y 2 - 6x - 8y = 0 

3. (a) 6x + 4y 4* 3z = 12 

(6) Not possible by this method. The given points are collinear, and satisfy the 


relation x — 2 


. y±l 
2 


z — 4 


. There are an infinite number of possible 


planes, but none are determined by this method, 
(c) 3x — 4y + z = 36 

4. (y - A) 2 = 4a(x - k) 


6 . 





y 2 

x y 

1 










yi 2 

ya 2 

xi yi 
X 2 ya 

1 

1 

= 

0; i 

[y +1) 4 

- 

—16x 




ya 2 

xs ya 

1 










X 2 

x y 

1 

i 









Xl 2 

X2 2 

xi yi 
X2 y2 

1 

1 

= 0; 

(y 

-4) 

S3 

(x 4- l) 2 




X8 2 

X8 ya 

1 

i 









X 2 

4-y 2 4 

- 2 S 


X 

y 

2 

1 





Xl 2 

+ yi 2 4 

- 2l 2 

Xl 

yi 

zi 

1 





X2 2 

+ y2 a + Z2 

X2 

ya 

z 2 

1 

■> 0 




X8 2 

+ ys 2 + Z3 2 

X8 

ya 

Z8 

1 





X4 2 

4-y4 2 4 

- Z4 

2 

X4 

y4 

24 

1 


XI 

n 

21 1 










X2 

Xs 

ya 

ys 

Z2 1 
Z2 1 

= 

0, then 

the 

four points are coplanar. 

X4 


24 1 











If 


Note that this does not necessarily mean there is no sphere through the points; 
if the four points lie on a circle in this plane then an infinite number of spheres 
could be found, but none are determined by the above equation. 


+ Z_±_ = i 

9 M6 144 

8- («) 5J5 ~ XP - *. « hyperbola 


24a 

c) x 2 — y 2 = 9, a rectangular hyperbola 

d) (jc + 2yX*_— 2y) = 0, a line pair 


/i\ * 4 , y* . 

(i) 55 + !ooS 


■ 1, an ellipse 
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9. (q) 5 points determine a conic 



X 2 

*7 

7 2 

X 

y 

1 

k*l* 

XI yi 

71 2 

XI 

yi 

1 

X2 2 

X2^2 

72 2 

X2 

yz 

1 

<xa* 

X3J>3 

73 2 

X3 

yz 

1 

X4 2 

X4^4 

74 s 

X4 

74 

1 

X5 2 

X5y5 

75 2 

X5 

75 

1 


= 0 


(*) xy ,3a: = 9, a hyperbola. 

10. Nine points'determine a cubic curve. 

The top row of the required determinant is 

x 3 x 2 y xy 2 y 3 x 2 xy y 2 x y 1 

11. Nine joints determine a quadric surface. 

The top of the required determinant is 

■v-2 


xy yz zx x y z 1 


12 . 


x y 
xx yi 


X2 

X3 


yz 

ys 


= o 


1 x 4 74 

The four fixed points must not lie in a plane. 


Exercise 4.1 

i. y = -3x-2;y= —f-f 


2. ot = p + — where « = 0,1,2 . . 

- (i!) 

4. 55 k - 55; * = 

i) 

6. Ml = 3 A-' = j 

/-a v -2\ 

1-2 1 

l § -i 0^ 

12. 1 

Exercise 4.2 

n 0 0 \ 

2. (i) 0 0 1 

\0 1 0/ 



6. Tr C = 1 + 2 cos 


Exercise 5.1 

The answers are given using the theorem on p. 125. 


1. Ai = 7, A 2 = 2; k n = i(7 n - 2«) 

2. Ax = I, A 2 = -i; = | (l - (- 5 )“). As n 


3. Ai = A 2 = i; 

4. 2i = 5, X 2 — 2; 

5. Ai = A 2 = 5; 

6 . Ai = X 2 = 4; 

7. Ai = A 2 = 2; 

8. Ai = A 2 = — 1; 


*« - 3((i)« - ©») 
kn - 4(5* - 2*) 
kn = n5 n_1 

kn = M*" 1 
kn = «2 rt_1 
*ft = /l(-l)*- 1 





Answers 


249 


Exercise 5.3 


1 . and is independent of p. 

2. (i) 0*536 (ii) Probability of a breakdown 

(iii) Probability of correct storage -► J. 

(a) 0*59049 (b) 0*37928 

3. 0*375; 0*1765 4. 0*24; 0*072 

5 . (i) i (ii) a 6. ai = az = tfs — £ 

k 

SRC 


S 

k + l R 
C 



1. 



Exercise 6.1 

There are a variety of possibilities for each answer. 

1 . wy' 4* w'yz 4* xyz 

2 . z' + x'y' + wx 

3. xy + x'wz + x'y'wz' 

4 . *yz' 4- x'y'w' 4- y'zw' 4- xz'w + x'yzw 

5. yV 4* xw'z' + x'y'w 

6 . x'zw 4* xyzw 4* x'y'z'w' + *y'*w' + xy'z'w 

7. >v + y'z 4* x'y'z' 

8 . xy 4- w'y 4* w'z 


Exercise 6.2 

1. 


(0 

0 

1 


(« 


0 

0 

°) 

ll 

1 

0 

o) 

lo 

1 

1 

lj 

(0 

0 

0 

o\ 

W) 

a 

0 

1 

n 

(o 

1 

0 

oj 

li 

1 

1 

ii 


-x'—y —z- 


i M (1 

Mi 

Mi 

w(J 


3. (a) 


W 


4. (a) 


1 1 
1 0 
1 1 
0 0 
1 1 
1 0 
1 1 
0 0 
0 1 
1 1 
1 1 
0 1 
0 1 
1 1 
1 1 
0 1 
1 1 
0 1 
0 1 
1 1 


3 

2 ) 

2 ) 

2 ) 

1\ 


<» (2! i 2) 
»( 2 ! 2 2 ) 
Mi ! 2 2) 
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1 

0 

1 

0 

0 


0 l \ 
1 1 

0 1 / 

1 l \ 
1 1 

1 0 / 


3. Yes 


/0 0 1 0 \ 
r/A 1 ° 0 0 °\ 

lo 0 1 1/ 

\l 0 0 1/ 

Exercise 7.1 

. . . (0 1 1 1\ 

L (o 0 0 o) 

/o 1 1 1 1 0 

2. (0 1 1 0 1 0 

\1 0 0 0 1 0 

. . . (1 0 I 0\ 

4 - (a) (l 1 0 o) 

/I 0 0 

5. 0 1 0 

\1 1 0 

«■ <«> (S) 

-0 

w (a 

Exercise 7.2 

U <*** (o)’ (l)’ ({). (l) repeating 

(§)•(“)•(?)•(■)•(?) 


»(i i ?!) 


«c:: i) 

Mi)-Ml) 

«.©•({)•«)•(!) 

/° 

OT i 


»(?) 




2. Cycle 


and repeating. 


3. Cycle 10 



(?) 


A /O 

01 repeating to 10 

.0/ U 


4. Cycle (j)» (}) and repeating. 


Exercise 8.1 

/O 3 7 8 9^ 

3 0 4 5 6 

3. I 7 4 0 2 3 

8 5 2 0 2 

\9 6 3 2 oy 


4. acdefg 17 


7. A 


1 2 3 4 5 

a /I 0 1 1 OV 

.MO 0 0 1 l\ 

clo 1101r 
d \1 1 0 0 0 / 




Answers 


“ P Y 



9. abdef 12 10. adefgh 15 

11. Truro-Okehampton-Salisbury 166 miles 
14. a 




Exercise 9.1 

1. Un = 2* 

3. Un = 2" + 1 


6. Un 



3V7 


)(i 



+ 


2. Kn = 2* 

4. Un = 2.4" + (-3)" 


Exercise 9.2 

1. x = (B - A)e* + (2A - B)e‘ 

2. x = (B-3A)te? t + Ae? t 

D 

3. jc=2 sin2/ + ^ cos2/ 


Exercise 9.3 

1. x = it 2 + £ * + ii 

Exercise 9.4 
1.x- e%4/ + B) 

Exercise 9.5 

1 . * « e 2e (/ 2 + 44/ + *) 

3. x^e^ + eH-i*+ 70 


2. x = iMcosf — sinO 

2. x = er**(Ct + D) 

2. x - e*(*/ 2 + O + I>) 

4. x = f(G sin 2/ + tfcos 2/) 


Exercise 9.6 


1. Ai-7 V 2+ V2 2a > 

2a - j V2- V2 24 ■ 

Solution as in worked example. 

/ cos «)0 —sin c»0 \ 

\— sin a)0 cos 0)0/ 

Exercise 9.10 

1. Strictly determined. Value 3. 

3. Not strictly determined. 

5. Strictly determined. Value 0. 

Exercise 9.11 
l. pi = f;?i = f;*> 

3. /u — t; #1 = t; o = 7f 
5. = f; ox — t;» = f 

7. pi = f; o = f 

8 . = ff; f? « ft. He should buy the 

Exercise 9.12 

1. 3 2. 3 


-y V 2 + V2 
V 2 W! 


2. Not strictly determined. 
4. Not strictly determined. 
6 . Not strictly determined. 


2. pi = i; qx = i; o = 44 

4. Strictly determined. 0 = 3 
6 . Strictly determined, v — 4 

chocolates regardless I 

3. 3 4. 7 



This hook is intended as an extension and continuation of 
Matrices 1 and Matrices 2 (both by G Matthews in the 
Contemporary SchQQi Mathematics series), at sixth tram 
level. The book contains a short revision of '0 level matrix 
work but is, in fact, completely self-contained. The first 
section is an investigation of some aspects of matrix 
algebra and covers all the known A level syllabuses. The 
purpose of this section is to provide the mathematical 
tools, which will be of important use later on when deal mg 
with the applications. The second section gives an insight 
into some of the many and varied applications of matrices 
It is impossible to include a complete list of applications 
as this would be too large, but the selection given has been 
taken from as wide a field as possible and is of the greatest 
interest. 
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